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METHODS AND MEANS FOR MODIFICATION 
OF PLANT CHARACTERISTICS 



TECHNICAL FIELD 

5 

The present invention relates generally to methods and 
materials for use in modifying plant characteristics, 
particularly the vernalization response in plants. 



10 PRIOR ART 



Plants must integrate a wide variety of environmental signals 
in order to maximize their reproductive success. Part of this 
integration must involve perception of the seasons, both to 

^ ensure the plant flowers during the correct season (for which 

it is adapted) and to synchronise its flowering with other 
members of its own species, to increase the chances of cross- 
fertilization. Arabidopsis thaliana serves as a model plant, 
for it exhibits responses to a wide variety of environmental 

20 stimuli that are observed in many species. Amongst other 

stimuli, flowering in naturally occurring strains (ecotypes) of 
Arabidopsis can be promoted by vernalization, a long cold 
treatment that mimics the cold of winter. 



25 Many species of plants that grow in temperate or cooler climes 

have an obligate requirement for vernalization in order to 
flower. Such plants typically germinate in autumn, and over 
winter as vegetative plants, and flower in milder conditions of 
^ spring. Vernalization thus acts as a cue, to allow plants to 

3 0 sense the seasons, and to time their flowering to maximise 

their chance of reproductive success. 

Species for which flowering is important to crop production are 
numerous, essentially all crops which are grown from seed, with 

3 5 important examples being the cereals, rice and maize, probably 

the most agronomically important in warmer climatic zones, and 
wheat, barley, oats and rye in more temperate climates. 
Important seed products are oil seed rape, sugar beet, maize, 
sunflower, soybean and sorghum. Many crops which are harvested 

40 for their roots are, of course, grown annually from seed and 

the production of seed of any kind is very dependent upon the 
ability of the plant to flower, to be pollinated and to set 




seed. In horticulture, control of the timing of flowering is 
important. Horticultural plants whose flowering may be 
controlled include lettuce, endive and vegetable brassicas 
including cabbage, broccoli and cauliflower, and carnations and 
5 geraniums . 



15 



In view of the large number of commercially important crop 
species which have a requirement for vernalization in order tc 



flower, modification of this requirement (e.g. by reducing the 
10 duration of vernalization required, or changing the optimum 

temperature, or abrogating the requirement altogether) would be 
of agronomic interest. 



DISCLOSURE OF THE INVENTION 



The inventors have used a late flowering, vernalization 
responsive mutant of Arabidopsis , the fca mutant, as a 
background in which to isolate mutants that exhibit a reduced 
vernalization response and to identify vrnl alleles which are 

20 responsible for this phenotype . The VRN1 gene is the first 

Arabidopsis flowering time gene to be isolated that is 
apparently exclusive to the vernalization promotion pathway. 
As discussed in more detail below, manipulation of the gene may 
permit the control or modification of the vernalization 

25 response of agronomically important crop species. 

That VRN1 is required for a normal vernalization response is 
clear from the phenotype of the vrnl mutants. Further 
experiments by the inventors indicate that there is a 

30 quantitative aspect to VRN1 activity. This suggests that 

artificially increasing or decreasing the amount of VRNl(e.g., 
through overexpression or antisense suppression) may provide a 
tool to, inter alia fine-tune the kinetics and/or optimal 
temperature of the vernalization response; render plants immune 

35 to the effect of cold on flowering response; or alleviate the 

requirement for cold treatment altogether. In addition to 
quantitative manipulation, a further layer of control could be 
obtained by driving VRN1 sense or anti-sense constructs using 
promoters that are either on all the time (constitutive) ; 

40 inducible upon application of a specific molecule; or which 

"naturally" drive expression only during a certain portion of 
the plant life cycle, e.g., seed maturation or late vegetative 



phase . 

Such methods could be used to improve agronomical ly important 
crop species, for instance as follows: 

5 

(a) Extension of geographic range of elite varieties: If an 
elite cultivar of a crop originates from a geographic area 
where it has adapted to require a certain vernalization period, 
and it is therefore climatically-limited m its range, then 

10 fine-tuning the expression of VRN1 may permit alteration of the 

length and intensity of cold treatment required to achieve an 
optimum flowering time in new geographic areas. Two facts are 
noteworthy in this regard: (1) even modest alterations in 
vernalization response could open up huge new areas of 
^5 cultivation for particular elite varieties (an analogous 

situation to that in which small changes in climatic conditions 
can alter the ecology and character of huge areas of 
landscape), and (2) the commercial success of elite genotypes 
is largely hampered by dependency on specific climatic 

20 conditions found in limited geographic areas. 

(b) Shortening of vernalization period: if a winter crop can be 
sown and left in the ground for a shorter period than usual 
(i.e. a reduced vernalization time, perhaps resulting from 

25 increased- or mis -expression of VRN1) this may reduce the risk 

associated with severe winter weather conditions, as the crops 
are exposed to winter conditions for a shorter time. 

O (c) Extension of vegetative growth: If the crop in question is 

30 one in which the vegetative portions of the plant are the 

desired product (e.g., leaf vegetables, sugar beet), then 
preventing the plant from flowering in response to cold 
temperature (i.e., by rendering it less sensitive to the cold 
by impairing VRN1 function) would prevent diversion of valuable 
35 plant resources from the vegetative tissues to the developing 

reproductive tissues, thereby increasing yield. 

Further experiments indicate that species other than 
Arabidopsis contain genes similar to VRN1 . Additionally, 
40 homologues and/or orthologues and/or paralogues of VRN1 (such 

as RTV1) may also exist in Arabidopsis and other species. 
Based on the disclosure herein, such genes may be isolated 




without undue burden by those skilled in the art and used 
analogously to those disclosed herein. 

These and other aspects of the present invention will now be 
discussed in more detail . 

Thus according to one aspect of the present invention there is 
provided an isolated nucleic acid molecule which is capable of 
"specif i : c a~Tl y ~alt"efi ng t He~v e r n aTTsaTt 1 ofT^ r es^pons"e — of a plant" 
into which the nucleic acid is introduced. 

The alteration in the vernalisation response may be assessed by 
comparison with a plant in which the nucleic acid has not been 
so introduced. 

The vernalization response phenotype of plants may be 
investigated by examining their flowering time in response to 
differing durations of vernalization treatment. In the 
experiments below this was assessed in two ways: (1) as the 
total number of vegetative leaves produced prior to flowering 
(LN) , and (2) as the time in days from the end of the 
vernalization treatment to the appearance of the first floral 
bud (FT) . However any appropriate method known to those 
skilled in the art may be used. 

Apart from the specific change in vernalisation response, it is 
preferred that other characteristics of the plant are 
substantially unchanged by the polypeptide, which is to say 
that the polypeptide acts specifically on this response and not 
more generally on flowering time characteristics or other 
stimuli, such as those mediated by other loci such as the FRI 
locus (Clarke and Dean, 1994, Mol . Gen. Genet. 242, 81-89) or 
the VRN2 locus (Chandler et al . , 1996). 

Preferably the isolated nucleic acid molecule capable of 
specifically altering the vernalisation response of a plant is 
obtainable from the VRN1 locus of a plant, more preferably from 
A . thaliana . 

Nucleic acid according to the present invention may include 
cDNA, RNA, genomic DNA and modified nucleic acids or nucleic 
acid analogs (e.g. peptide nucleic acid). Where a DNA sequence 



35 



40 



is specified, e.g. with reference to a figure, unless context 
requires otherwise the RNA equivalent, with U substituted for T 
where it occurs, is encompassed. Nucleic acid molecules 
according to the present invention may be provided isolated 
and/or purified from their natural environment, in 
substantially pure or homogeneous form, or free or 
substantially free of other nucleic acids of the species of 
origin. Where used herein, the term "isolated" encompasses all 



of these possibilities. The nucleic acid molecules may be 
10 wholly or partially synthetic. In particular they may be 

recombinant in that nucleic acid sequences which are not found 
together in nature (do not run contiguously) have been ligated 
or otherwise combined artificially. Alternatively they may 
have been synthesised directly e.g.. using an automated 
synthesiser. 

Thus in one aspect of the invention, there is disclosed a 
nucleic acid encoding the polypeptide of Fig 7. The VRN1 
polypeptide is 341 amino acids in length and is comprised of at 

20 least three regions. Region 1 (residues 2-94 in Figure 7) and 3 

(residues 239-332) can be aligned to each other, and are 
related to the B3 DNA-binding domain originally found in the 
maize transcription factor VIVIPAROUS1 (VP1; McCarty et al . , 
1991) . Region 2 of VRN1 (residues 95-238) , which lies between 

25 the two B3 domains is not obviously related to any domain of 

known function, nor does it have obvious features of a 
transcriptional activation or repression domain. Nonetheless, 
region 2 does contain several provocative sequence features and 

^) motifs, including a putative nuclear localization signal (NSL) , 

30 two putative PEST regions, and three RXXL motifs also 

associated with rapid protein degradation (Cooper et al . , 
1997) . Interestingly, the second PEST region of VRN1 contains a 
potential protein kinase C (PKC) phosphorylation site (residues 
176-178) . 



One nucleic acid encoding this polypeptide is shown in Fig 7 
from nucleotides 269-1295 inclusive (including stop codon) . 
Other nucleic acids of the invention include those which are 
degeneratively equivalent to this. 

A genomic sequence including the VRN1 locus is shown in Annex 
I. The putative cDNA sequence transcribed from this genomic 



sequence is shown at Fig 7. Although this ORF has been 
designated the VRN1 ORF herein, it will be appreciated by those 
skilled in the art that the discussion hereinafter applies 
equally to any other ORF present in the described sequence 
which has the properties attributed to VRN1 . 

In a further aspect of the present invention there are 
disclosed nucleic acids which are variants of the VRN1 



sequences discussed above 



A variant nucleic acid molecule shares homology with, or is 
identical to, all or part of the sequences discussed above. 

Such variants may be used to alter the vernalisation 
characteristics of a plant, as assessed by the methods 
disclosed herein. For instance a variant nucleic acids may be 
include a sequence encoding a functional polypeptide (e.g. 
which may be a variant of the VRN1 polypeptide and which may 
cross-react with an antibody raised to said polypeptide) . 
Alternatively they may include a sequence which interferes with 
the expression or activity of such a polypeptide (e.g. sense or 
anti-sense suppression of a VRN1 coding sequence) . 

Variants may also be used to isolate or amplify nucleic acids 
which have these properties (e.g. by inclusion of a sequence 
which is hybridisable with a VRN1 sequence.) . 

Generally speaking variants may be: 

(i) Novel, naturally occurring, nucleic acids, isolatable using 
the sequences of the present invention. They may include 
alleles (which will include polymorphisms or mutations at one 
or more bases - for instance vrnl-1 or vrnl-2 shown in Fig 7) 
or pseudoalleles (which may occur at closely linked loci to the 
VRN1 gene) . Also included are paralogues, isogenes, or other 
homologous genes belonging to the same family as the VRN1 gene. 
Although these may occur at different genomic loci to the gene, 
they are likely to share conserved regions with it (see e.g. 
RTV1 in the Examples below) . Also included are homologues of 
VRN1 from other plant species. 



(ii) Artificial nucleic acids, which can be prepared by the 



skilled person in the light of the present disclosure. Such 
derivatives may be prepared, for instance, by site directed or 
random mutagenesis, or by direct synthesis. Preferably the 
variant nucleic acid is generated either directly or indirectly 
5 (e.g. via one or more amplification or replication steps) from 

an original nucleic acid having all or part of the VRN1 
sequence shown in Fig 7 . 



Particularly included are variants which comprise only a 
10 distinctive part or fragment (however produced) corresponding 

to a portion of the sequence provided. The fragments may encode 
particular functional parts of the polypeptide. Alternatively, 
the fragments may have utility in probing for, or amplifying, 
the sequence provided or closely related ones. Suitable 
lengths of fragment, and conditions, for such processes are 
discussed in more detail below. 

Also included are nucleic acids corresponding to those above, 
but which have been extended at the 3' or 5 1 terminus. 

20 

The term 'variant' nucleic acid as used herein encompasses all 
of these possibilities. When used in the context of 
polypeptides or proteins it indicates the encoded expression 
product of the variant nucleic acid. 

25 

Some of the aspects of the present invention relating to 
variants will now be discussed in more detail . 

4) Homology (similarity or identity) may be assessed as set out in 

3 0 the Materials and Methods section in the Examples below. 

Homology may be at the nucleotide sequence and/or encoded amino 
acid sequence level. Preferably, the nucleic acid and/or amino 
acid sequence shares at least about 65%, or 70%, or 80% 
35 identity, most preferably at least about 90%, 95%, 96%, 97%, 

98% or 99% identity. 

Homology may be over the full-length of the relevant sequence 
shown herein, or may be over a part of it, preferably over a 

4 0 contiguous sequence of about or greater than about 20, 25, 30, 

33, 40, 50, 67, 133, 167, 200, 233, 267, 300, or more amino 
acids or codons, compared with Fig 7. 



• # 



Thus a variant polypeptide encoded by a nucleic acid of the 
present invention may include within the sequence shown in Fig 
7, a single amino acid or 2 , 3, 4, 5, 6, 7, 8, or 9 changes, 
about 10, 15, 20, 30, 40 or 50 changes, or greater than about 
50, 60, 70, 80 or 90 changes. 

In a further aspect of the invention there is disclosed a 



"method of producing a derTva"tTve nucleic acid comprising the 
step of modifying any of the sequences disclosed above, 
particularly the coding sequence of Fig 7. 

Changes may be desirable for a number of reasons. For instance 
they may introduce or remove restriction endonuclease sites or 
alter codon usage. 

Alternatively changes to a sequence may produce a derivative by 
way of one or more of addition, insertion, deletion or 
substitution of one or more nucleotides in the nucleic acid, 
leading to the addition, insertion, deletion or substitution of 
one or more amino acids in the encoded polypeptide. 

Such changes may modify sites which are required for post 
translation modification such as cleavage sites in the encoded 
polypeptide; motifs in the encoded polypeptide for 
phosphorylation etc. (e.g. residues 176-178 in Figure 7). 
Leader or other targeting sequences (e.g. membrane or golgi 
locating sequences) may be added to the expressed protein to 
determine its location following expression if it is desired to 
isolate it from a microbial system. 

Other desirable mutations may be random or site directed 
mutagenesis in order to alter the activity (e.g. specificity) 
or stability of the encoded polypeptide. Changes may be by way 
of conservative variation, i.e. substitution of one hydrophobic 
residue such as isoleucine, valine, leucine or methionine for 
another, or the substitution of one polar residue for another, 
such as arginine for lysine, glutamic for aspartic acid, or 
glutamine for asparagine . As is well known to those skilled in 
the art, altering the primary structure of a polypeptide by a 
conservative substitution may not significantly alter the 
activity of that peptide because the side-chain of the amino 



acid which is inserted into the sequence may be able to form 
similar bonds and contacts as the side chain of the amino acid 
which has been substituted out. This is so even when the 
substitution is in a region which is critical in determining 
the peptides conformation. Also included are variants having 
non-conservative substitutions. As is well known to those 
skilled in the art, substitutions to regions of a peptide which 
are not critical in determining its conformation may not 



greatly affect its activity because they do not greatly alter 
10 the peptide's three dimensional structure. In regions which are 

critical in determining the peptides conformation or activity 
such changes may confer advantageous properties on the 
polypeptide. Indeed, changes such as those described above may 
confer slightly advantageous properties on the peptide e.g. 
£5 altered stability or specificity. 

Particular regions, or domains, of VRN1 may have utility in 
their own right. For instance the B3 domains may be used to 
direct gene expression in a precise manner, for instance by the 

20 recognition of specific DNA sequences that represent elements 

in the promoters of their normal target genes. By creating 
fusion proteins, comprising the DNA binding domain (or domains) 
of VRN1 , and a heterologous activation or repression domain 
borrowed from another protein, the expression of VRN1 target 

25 genes could be controlled. This may lead to a precise control 

of the expression of those genes that are normally targets of 
VRN1 . Given that such genes, alone or in combination, 
ultimately control the transition to flowering (usually 
£ following vernalization) , their directed expression in other 

3 0 conditions may provide a useful means to control flowering. 

Furthermore, the use of fusions based on the DNA binding 
domains in conventional SELEX or one -hybrid experiments may be 
used to reveal the target genes or DNA sequences normally bound 
by VRN1 . Thus nucleic acids encoding these domains, or fusion 

3 5 proteins comprising them, form one embodiment of this aspect of 

the present invention. 

In a further aspect of the present invention there is provided 
a method of identifying and/or cloning a VRN1 nucleic acid 

4 0 variant from a plant which method employs a sequence described 

above . 



In one embodiment, nucleotide sequence information provided 



herein may be used in a data-base (e.g. of ESTs, or STSs) 
search to find homologous sequences, such as those which may 
become available in due course, and expression products of 
which can be tested for activity as described below. 

5 

In another embodiment the nucleotide sequence information 
provided herein may be used to design probes and primers for 
probing or amplification. An oligonucleotide for use in 
probing or PGR — may-Jae.-- a bout -3 0 or fewer nucl eot i de s-J-ru le n gfch.--- 

10 (e.g. 18, 21 or 24) . Generally specific primers are upwards cf 

14 nucleotides in length. For optimum specificity and cost 
effectiveness, primers of 16-24 nucleotides in length may be 
preferred. Those skilled in the art are well versed in the 
design of primers for use in processes such as PCR . If 

15 required, probing can be done with entire restriction fragments 

of the gene disclosed herein which may be 100' s or even 1000' s 
of nucleotides in length. Small variations may be introduced 
into the sequence to produce 'consensus' or 'degenerate' 
primers if required. 

20 

Such probes and primers form one aspect of the present 
invention . 

Probing may employ the standard Southern blotting technique. 

25 For instance DNA may be extracted from cells and digested with 

different restriction enzymes. Restriction fragments may then 
be separated by electrophoresis on an agarose gel, before 
denaturation and transfer to a nitrocellulose filter. Labelled 
probe may be hybridised to the single stranded DNA fragments on 

30 the filter and binding determined. DNA for probing may be 

prepared from RNA preparations from cells. Probing may 
optionally be done by means of so-called 'nucleic acid chips' 
(see Marshall 5c Hodgson (1998) Nature Biotechnology 16: 27-31, 
for a review) . 

35 

In one embodiment, a variant in accordance with the present 
invention is obtainable by means of a method which includes: 

(a) providing a preparation of nucleic acid, e.g. from plant 
40 cells. Test nucleic acid may be provided from a cell as genomic 

DNA, cDNA or RNA, or a mixture of any of these, preferably as a 
library in a suitable vector. If genomic DNA is used the probe 
may be used to identify unt ranscribed regions of the gene (e.g. 
promoters etc.), such as are described hereinafter, 
45 (b) providing a nucleic acid molecule which is a probe or 




primer as discussed above, 

(c) contacting nucleic acid in said preparation with said 
nucleic acid molecule under conditions for hybridisation of 
said nucleic acid molecule to any said gene or homologue in 

5 said preparation, and, 

(d) identifying said gene or homologue if present by its 
hybridisation with said nucleic acid molecule. Binding of a 
probe to target nucleic acid (e.g. DNA) may be measured using 

— a n y of a - varie ty of t e chniques — at the dispo s al Q ^-t4^&a--^k44-l^ed. 

10 in the art. For instance, probes may be radioact ively , 

f luorescently or enzymatically labelled. Other methods not 
employing labelling of probe include amplification using PCR 
(see below), RN'ase cleavage and allele specific 
oligonucleotide probing. The identification of successful 
15 hybridisation is followed by isolation of the nucleic acid 

0 which has hybridised, which may involve one or more steps of 

PCR or amplification of a vector in a suitable host. 

Preliminary experiments may be performed by hybridising under 
20 low stringency conditions. For probing, preferred conditions 

are those which are stringent enough for there to be a simple 
pattern with a small number of hybridisations identified as 
positive which can be investigated further. 

25 For example, hybridizations may be performed, according to the 

method of Sambrook et al . (below) using a hybridization 
solution comprising: 5X SSC (wherein V SSC = 0.15 M sodium 
chloride; 0.15 M sodium citrate; pH 7), 5X Denhardt ' s reagent, 
0.5-1.0% SDS, 100 ixq/xnl denatured, fragmented salmon sperm DNA, 

^0 0.05% sodium pyrophosphate and up to 50% f ormamide . 

w Hybridization is carried out at 37-42°C for at least six hours. 

Following hybridization, filters are washed as follows: (1) 5 
minutes at room temperature in 2X SSC and 1% SDS; (2) 15 
minutes at room temperature in 2X SSC and 0.1% SDS; (3) 3 0 

35 minutes - 1 hour at 37°C in IX SSC and 1% SDS; (4) 2 hours at 

42-65°C in IX SSC and 1% SDS, changing the solution every 30 
minutes . 

One common formula for calculating the stringency conditions 
40 required to achieve hybridization between nucleic acid 

molecules of a specified sequence homology is (Sambrook et al . , 
1989) : 

T m = 81.5°C + 16.6Log [Na + ] +0.41 (% G+C) - 0.63 (% f ormamide) 
- 600/#bp in duplex 



45 



As an illustration of the above formula, using [Na+] = [0.368] 
and 50-% formamide, with GC content of 42% and an average probe 
size of 200 bases, the T m is 57°C. The T m of a DNA duplex 
decreases by 1 - 1 . 5°C with every 1% decrease in homology. 
Thus, targets with greater than about 75% sequence identity 
would be observed using a hybridization temperature of 42°C. 
Such a sequence would be considered substantially homologous to 
the nucleic acid sequence of the present invention. 



It is well known in the art to increase stringency of 
hybridisation gradually until only a few positive clones 
remain. Other suitable conditions include, e.g. for detection 
of sequences that are about 80-90% identical, hybridization 
overnight at 42°C in 0.25M Na 2 HP0 . , pH 7 . 2 , 6.5% SDS, 10% 
dextran sulfate and a final wash at 55°C in 0 . IX SSC, 0.1% SDS. 
For detection of sequences that are greater than about 90% 
identical, suitable conditions include hybridization overnight 
at 65°C in 0.25M Na 2 HP0 4 , pH 7.2, 6.5% SDS, 10% dextran sulfate 
and a final wash at 60°C in 0 . IX SSC, 0.1% SDS. 

Thus this aspect of the present invention includes a nucleic 
acid including or consisting essentially of a nucleotide 
sequence of complementary to a nucleotide sequence hybridisable 
with any encoding sequence provided herein. Another way of 
looking at this would be for nucleic acid according to this 
aspect to be hybridisable with a nucleotide sequence 
complementary to any encoding sequence provided herein. 

In a further embodiment, hybridisation of nucleic acid molecule 
to a variant may be determined or identified indirectly, e.g. 
using a nucleic acid amplification reaction, particularly the 
polymerase chain reaction (PCR) . PCR requires the use of two 
primers to specifically amplify target nucleic acid, so 
preferably two nucleic acid molecules with sequences 
characteristic of VRN1 are employed. Using RACE PCR, only one 
such primer may be needed (see "PCR protocols; A Guide to 
Methods and Applications 11 , Eds. Innis et al, Academic Press, 
New York, (1990) ) . 

Preferred primers for amplification of conserved regions of 
VRN1 for use as probes to obtain genomic or cDNA clones may 
include any of those shown in Table 3 . 

For instance primers S63 and S49 may be used to amplify a VRN1 
genomic region including the promoter and 3' end of the gene. 



13 

Primers V7 and V2 amplify the VRN1 cDNA ORF . Primers V6 and 
V15 may be used to distinguish VRN1 and RTV1 . 

Thus a method involving use of PCR in obtaining nucleic acid 
5 according to the present invention may include: 

(a) providing a preparation of plant nucleic acid, e.g. from a 
seed or other appropriate tissue or organ, 

(b) providing a pair of nucleic acid molecule primers useful in 

10 primer according to the present invention as discussed above, 

(c) contacting nucleic acid in said preparation with said 
primers under conditions for performance of PCR, 

(d) performing PCR and determining the presence or absence of 
an amplified PCR product. The presence of an amplified PCR 

15 product may indicate identification of a variant. 

In all cases above, if need be, clones or fragments identified 
in the search can be extended. For instance if it is suspected 
that they are incomplete, the original DNA source (e.g. a clone 
20 library, mRNA preparation etc.) can be revisited to isolate 

missing portions e.g. using sequences, probes or primers based 
on that portion which has already been obtained to identify 
other clones containing overlapping sequence. 

25 If a putative naturally occurring homologous sequence is 

identified, its role in vernalisation can be confirmed, for 
instance by methods analogous to those used in the Examples 
below, or by generating mutants of the gene (e.g. by screening 
the available insert ional -mutant collections) and analyzing 
^0 these for their ability to respond to vernalization, possibly 

™ in the presence and absence of other alleles such as vrnl . 

Alternatively the role can be inferred from mapping vrn mutants 
to see if the homologue lies at or close to an appropriate 
locus . 

35 

In a further embodiment, antibodies raised to a VRN1 
polypeptide or peptide can be used in the identification and/or 
isolation of variant polypeptides, and then their encoding 
genes. Thus, the present invention provides a method of 

40 identifying or isolating VRN1 or variant thereof, comprising 

screening candidate polypeptides with a polypeptide comprising 
the antigen-binding domain of an antibody (for example whole 
antibody or a fragment thereof) which is able to bind said VRN1 
polypeptide or variant thereof, or preferably has binding 

45 specificity for such a polypeptide. Methods of obtaining 
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antibodies are described hereinafter. 

Candidate polypeptides for screening may for instance be the 
products of an expression 1 ibrary created us ing nucleic acid 
5 derived from an plant of interest, or may be the product of a 
purification process from a natural source. A polypeptide 
found to bind the antibody may be isolated and then may be 
subject to amino acid sequencing. Any suitable technique may 
be —used to—sequence the pol y p e p t i d e ei the ir wholly o r partially — 

10 (for instance a fragment of the polypeptide may be sequenced) . 

Amino acid sequence information may be used in obtaining 
nucleic acid encoding the polypeptide, for instance by 
designing one or more oligonucleotides (e.g. a degenerate pool 
of oligonucleotides) for use as probes or primers in 

15 hybridization to candidate nucleic acid. 

This aspect of the invention further includes an isolated 
nucleic acid comprising a sequence which is complementary to 
any of those isolated or obtained as above. 

20 

As used hereinafter, unless the context demands otherwise, the 
term "VRN1" is intended to cover any of the nucleic acids of 
the invention described above, including functional variants. 

25 In one aspect of the present invention, the VRN1 nucleic acid 

described above is in the form of a recombinant and preferably 
repli cable vector . 

"Vector" is defined to include, inter alia, any plasmid, 

3 0 cosmid, phage or Agrobacterium binary vector in double or 

single stranded linear or circular form which may or may not be 
self transmissible or mobilizable, and which can transform a 
prokaryotic or eukaryotic host either by integration into the 
cellular genome or exist extrachromosomally (e.g. autonomous 
35 replicating plasmid with an origin of replication) . 

Generally speaking, those skilled in the art are well able to 
construct vectors and design protocols for recombinant gene 
expression. Suitable vectors can be chosen or constructed, 

4 0 containing appropriate regulatory sequences, including promoter 

sequences, terminator fragments, polyadenylat ion sequences, 
enhancer sequences, marker genes and other sequences as 
appropriate. For further details see, for example, Molecular 
Cloning : a Laboratory Manual: 2nd edition, Sambrook et al, 
45 1989, Cold Spring Harbor Laboratory Press or Current Protocols 
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in Molecular Biology, Second Edition, Ausubel et al . eds . , John 
Wiley 8c Sons , 1992 . 

Specifically included are shuttle vectors by which is meant a 
5 DNA vehicle capable, naturally or by design, of replication in 

two different host organisms, which may be selected from 
actinomycetes and related species, bacteria and eucaryotic 
(e.g. higher plant, mammalian, yeast or fungal cells) . 

10 A vector including nucleic acid according to the present 

invention need not include a promoter or other regulatory 
sequence, particularly if the vector is to be used to introduce 
the nucleic acid into cells for recombination into the genome. 

15 Preferably the nucleic acid in the vector is under the control 

£ of, and operably linked to, an appropriate promoter or other 

regulatory elements for transcription in a host cell such as a 
microbial, e.g. bacterial, or plant cell. The vector may be a 
bi - functional expression vector which functions in multiple 

20 hosts. In the case of genomic DNA, this may contain its own 

promoter or other regulatory elements and in the case of cDNA 
this may be under the control of an appropriate promoter or 
other regulatory elements for expression in the host cell 

25 By "promoter" is meant a sequence of nucleotides from which 

transcription may be initiated of DNA operably linked 
downstream (i.e. in the 3' direction on the sense strand of 
double -stranded DNA) . 

^2.0 "Operably linked" means joined as part of the same nucleic acid 

™ molecule, suitably positioned and oriented for transcription 

to be initiated from the promoter. DNA operably linked to a 
promoter is "under transcriptional initiation regulation" of 
the promoter. 

35 

In a preferred embodiment, the promoter is an inducible 
promoter . 

The term "inducible" as applied to a promoter is well 
40 understood by those skilled in the art. In essence, expression 

under the control of an inducible promoter is "switched on" or 
increased in response to an applied stimulus. The nature of the 
stimulus varies between promoters. Some inducible promoters 
cause little or undetectable levels of expression (or no 
45 expression) in the absence of the appropriate stimulus. Other 




inducible promoters cause detectable constitutive expression in 
the absence of the stimulus. Whatever the level of expression 
is in the absence of the stimulus, expression from any 
inducible promoter is increased in the presence of the correct 
5 stimulus. 

Thus this aspect of the invention provides a gene construct, 
preferably a replicable vector, comprising a promoter 

^cpt i-onal l-y-iflducii)le) opexakly^-^nked^t^x- a nuc l eot i de se quence 

10 provided by the present invention, such as the VRN1 gene or a 

variant thereof . 

Particularly of interest in the present context are nucleic 
acid constructs which operate as plant vectors. Specific 

15 procedures and vectors previously used with wide success upon 

plants are described by Guerineau and Mullineaux (1993) (Plant 
transformation and expression vectors. In: Plant Molecular 
Biology Labfax (Croy RRD ed) Oxford, BIOS Scientific 
Publishers, pp 121-148) . Suitable vectors may include plant 

20 viral-derived vectors (see e.g. EP-A-194809) . 

Suitable promoters which operate in plants include the 
Cauliflower Mosaic Virus 35S (CaMV 35S) . Other examples are 
disclosed at pg 120 of Lindsey & Jones (1989) "Plant 

25 Biotechnology in Agriculture" Pub. OU Press, Milton Keynes, UK. 

The promoter may be selected to include one or more sequence 
motifs or elements conferring developmental and/or tissue- 
specific regulatory control of expression. Inducible plant 
promoters include the ethanol induced promoter of Caddick et al 

30 (1998) Nature Biotechnology 16: 177-180. 

If desired, selectable genetic markers may be included in the 
construct, such as those that confer selectable phenotypes such 
as resistance to antibiotics or herbicides (e.g. kanamycin, 
35 hygromycin, phosphinotricin, chlorsulf uron, methotrexate, 

gentamycin, spectinomycin, imidazolinones and glyphosate) . 

The present invention also provides methods comprising 
introduction of such a construct into a plant cell or a 
40 microbial cell and/or induction of expression of a construct 

within a plant cell, by application of a suitable stimulus e.g. 
an effective exogenous inducer. 



In a further aspect of the invention, there is disclosed a host 
45 cell containing a heterologous construct according to the 
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present invention, especially a plant or a microbial cell. 

The term "heterologous" is used broadly in this aspect to 
indicate that the gene/sequence of nucleotides in question 
(e.g. encoding VRN1) have been introduced into said cells of 
the plant or an ancestor thereof, using genetic engineering, 
i.e. by human intervention. A heterologous gene may replace an 
endogenous equivalent gene, i.e. one which normally performs 
the sam e. or a s imilar function^ or th e i n serted sequence may b e_ 



10 additional to the endogenous gene or other sequence. Nucleic 

acid heterologous to a plant cell may be non-naturally 
occurring in cells of that type, variety or species. Thus the 
heterologous nucleic acid may comprise a coding sequence of or 
derived from a particular type of plant cell or species or 

15 variety of plant, placed within the context of a plant cell of 

^ a different type or species or variety of plant. A further 

possibility is for a nucleic acid sequence to be placed within 
a cell in which it or a homologue is found naturally, but 
wherein the nucleic acid sequence is linked and/or adjacent to 

20 nucleic acid which does not occur naturally within the cell, or 

cells of that type or species or variety of plant, such as 
operably linked to one or more regulatory sequences, such as a 
promoter sequence, for control of expression. 

25 The host cell (e.g. plant cell) is preferably transformed by 

the construct, which is to say that the construct becomes 
established within the cell, altering one or more of the cell's 
characteristics and hence phenotype e.g. with respect to a 
vernalisation response . 

Nucleic acid can be introduced into plant cells using any 
suitable technology, such as a disarmed Ti-plasmid vector 
carried by Agrobacterium exploiting its natural gene transfer 
ability (EP-A-270355 , EP-A- 0116718 , NAR 12(22) 8711 - 87215 

35 1984), particle or microproj ect ile bombardment (US 5100792, EP- 

A-444882, EP-A-434616) microinjection (WO 92/09696, WO 
94/00583, EP 331083, EP 175966, Green et al . (1987) Plant 
Tissue and Cell Culture, Academic Press) , electroporat ion (EP 
290395, WO 8706614 Gelvin Debeyser) other forms of direct DNA 

40 uptake (DE 4005152, WO 9012096, US 4684611), liposome mediated 

DNA uptake (e.g. Freeman et al . Plant Cell Physiol. 29: 1353 
(1984)), or the vortexing method (e.g. Kindle, PNAS U.S.A. 87: 
1228 (1990d) Physical methods for the transformation of plant 
cells are reviewed in Oard, 1991, Biotech. Adv. 9: 1-11. 

45 
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Agrobacterium transformation is widely used by those skilled ii 
the art to transform dicotyledonous species. 

Recently, there has also been substantial progress towards the 
routine production of stable, fertile transgenic plants in 
almost all economically relevant monocot plants (see e.g. Hiei 
et al. (1994) The Plant Journal 6, 271-282)). Microproj ectile 
bombardment, electroporat ion and direct DNA uptake are 
preferred wher e A g robacterium alone is ineffici ent or 



10 ineffective. Alternatively, a combination of different 

techniques may be employed to enhance the efficiency of the 
transformation process, eg bombardment with Agrobacterium 
coated micropart icles (EP-A-4 86234 ) or microproj ectile 

bombardment to induce wounding followed by co-cultivation with 
15 Agrobacterium (EP-A-486233 ) . 

Preferred transformation protocols for brassicas, wheat, barley 
and rice may be found Becker et al . , 1994 and references 
therein. However the skilled person will appreciate that the 
20 particular choice of a transformation technology will be 

determined by its efficiency to transform certain plant species 
as well as the experience and preference of the person 
practising the invention with a particular methodology of 
choice . 

25 

Thus a further aspect of the present invention provides a 
method of transforming a plant cell involving introduction of a 
construct as described above into a plant cell and causing or 
allowing recombination between the vector and the plant cell 
30 genome to introduce a nucleic acid according to the present 

invention into the genome. 

The invention further encompasses a host cell transformed with 
nucleic acid or a vector according to the present invention 

35 (e.g comprising the VRN1 sequence) especially a plant or a 

microbial cell. In the transgenic plant cell (i.e. transgenic 
for the nucleic acid in question) the transgene may be on an 
extra-genomic vector or incorporated, preferably stably, into 
the genome. There may be more than one heterologous nucleotide 

4 0 sequence per haploid genome. 

Generally speaking, following transformation, a plant may be 
regenerated, e.g. from single cells, callus tissue or leaf 
discs, as is standard in the art. Almost any plant can be 
45 entirely regenerated from cells, tissues and organs of the 
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plant. Available techniques are reviewed in Vasil et al . , Cell 
Culture and Somatic Cell Genetics of Plants, Vol J, JJ and III, 
Laboratory Procedures and Their Applications , Academic Press, 
1984, and Weissbach and Weissbach, Methods for Plant Molecular 
5 Biology, Academic Press, 1989. 

The generation of fertile transgenic plants has been achieved 
in the cereals rice, maize, wheat, oat, and barley (reviewed in 
Shlmamoto, K. — (1994) — Currmrt Opinion in BioLechnvlogy 5, 158- 
10 162.; Vasil, et al . (1992) Bio/Technology 10, 667-674; Vain et 

al., 1995, Biotechnology Advances 13 (4): 653-671; Vasil, 1996, 
Nature Biotechnology 14 page 702) . 

Plants which include a plant cell according to the invention 
are also provided. 

In addition to the regenerated plant, the present invention 
embraces all of the following: a clone of such a plant, seed, 
selfed or hybrid progeny and descendants (e.g. Fl and F2 

20 descendants) . The invention also provides a plant propagule 

from such plants, that is any part which may be used in 
reproduction or propagation, sexual or asexual, including 
cuttings, seed and so on. It also provides any part of these 
plants, which in all cases include the plant cell or 

25 heterologous VRN1 DNA described above. 

A plant according to the present invention may be one which 
does not breed true in one or more properties. Plant varieties 
may be excluded, particularly registrable plant varieties 
£) according to Plant Breeders' Rights. 

A further aspect of the present invention provides a method for 
assessing the vernalisation responsiveness of a plant, the 
method comprising the step of determining the presence and/or 

35 identity of a VRN1 allele therein comprising the use of a 

nucleic acid as described above. Such a diagnostic test may be 
used with transgenic or wild-type plants. The use of 
diagnostic tests for alleles allows the researcher or plant 
breeder to establish, with full confidence and independent from 

40 time consuming biochemical tests, whether or not a desired 

allele is present in the plant of interest (or a cell thereof) , 
whether the plant is a representative of a collection of other 
genetically identical plants (e.g. an inbred variety or 
cultivar) or one individual in a sample of related (e.g. 
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breeders' selection) or unrelated plants. 

The method may form part of a plant breeding scheme based on 
selection and selfing of desirable individuals. Reliable 
5 selection for appropriate VRN1 alleles can be made in early 

generations and on more material than would otherwise be 
possible. This gain in reliability of selection plus the time 
saving by being able to test material earlier and without 
- —costly- phe notype s c^^^a ing 1s—g£ — con s id era ble va-l^e— in— ^>l-ant — 
1 0 breeding . 

Nucleic acid-based determination of the presence or absence of 
one or more desirable alleles may be combined with 
determination of the genotype of the flanking linked genomic 

15 DNA and other unlinked genomic DNA using established sets of 

markers such as RFLPs , microsatell i tes or SSRs, AFLPs , RAPDs 
etc. This enables the researcher or plant breeder to select 
for not only the presence of the desirable allele but also for 
individual plant or families of plants which have the most 

20 desirable combinations of linked and unlinked genetic 

background. Such recombinations of desirable material may 
occur only rarely within a given segregating breeding 
population or backcross progeny. Direct assay of the locus as 
afforded by the present invention allows the researcher to make 

25 a step-wise approach to fixing (making homozygous) the desired 

combination of flanking markers and alleles, by first 
identifying individuals fixed for one flanking marker and then 
identifying progeny fixed on the other side of the locus all 
the time knowing with confidence that the desirable allele is 

30 still present . 

The present disclosure provides sufficient information for a 
person skilled in the art to obtain genomic DNA sequence for 
any given new or existing allele and devise a suitable nucleic 
35 acid- and/or polypept ide-based diagnostic assay. In designing 

a nucleic acid assay account is taken of the distinctive 
variation in sequence that characterizes the particular variant 
allele (see e.g. Fig 7 and the allelic variations described 
therein) . 

40 

The invention further provides a method of influencing or 
affecting the vernalisation response in a plant, the method 
including causing or allowing expression of a heterologous VRN1 
nucleic acid sequence as discussed above within the cells of 
45 the plant. The method may include the use of VRN1 nucleic acid 
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in conjunction with other genes affecting vernalisation (e.g. 
VRN2) . As discussed in the Examples below, VRN1 and VRN2 may 
act in separate and partially redundant vernalization-promoting 
pathways . 

5 

The step may be preceded by the earlier step of introduction of 
the VRN1 nucleic acid into a cell of the plant or an ancestor 
thereof. In addition to use of the nucleic acids of the 

present i nvent ion for production of — f unct i onal — V RN1 - 

10 polypeptides (thereby enhancing the vernalisation response) , 

the information disclosed herein may also be used to reduce the 
activity VRN1 activity in cells in which it is desired to do so 
(thereby inhibiting or destroying the vernalisation response) . 

15 For instance down- regulation of expression of a target gene may 

^ be achieved using anti-sense technology. 

In using anti -sense genes or partial gene sequences to 
down-regulate gene expression, a nucleotide sequence is placed 

20 under the control of a promoter in a "reverse orientation" such 

that transcription yields RNA which is complementary to normal 
mRNA transcribed from the "sense" strand of the target gene. 
See, for example, Rothstein et al , 1987; Smith et al, (1988) 
Nature 334, 724-726; Zhang et al, (1992) The Plant Cell 4, 1575- 

25 1588, English et al . , (1996) The Plant Cell 8, 179-188. 

Antisense technology is also reviewed in Bourque, (1995) , Plant 
Science 105, 125-149, and Flavell, (1994) PNAS USA 91, 3490- 
3496 . 

^3 An alternative to anti-sense is to use a copy of all or part of 

the target gene inserted in sense, that is the same, 
orientation as the target gene, to achieve reduction in 
expression of the target gene by co- suppression . See, for 
example, van der Krol et al . , (1990) The Plant Cell 2, 291-299; 
35 Napoli et al . , (1990) The Plant Cell 2, 279-289; Zhang et al . , 

(1992) The Plant Cell 4, 1575-1588, and US-A- 5 , 231 , 020 . 
Further refinements of the gene silencing or co-suppression 
technology may be found in W095/34668 (Biosource) ; Angell & 
Baulcombe (1997) The EMBO Journal 16,12:3675-3684; and Voinnet 
40 Sc Baulcombe (1997) Nature 389: pg 553. 

Further options for down regulation of gene expression include 
the use of ribozymes, e.g. hammerhead ribozymes, which can 
catalyse the site- specif ic cleavage of RNA, such as mRNA (see 
45 e.g. Jaeger (1997) "The new world of ribozymes" Curr Opin 



Struct Biol 7:324-335, or Gibson & Shillitoe ( 1 9 9 7 ) "Ribozymes : 
their functions and strategies form their use" Mol Biotechnol 
7: 242-251.) 

The complete sequence corresponding to the coding sequence (in 
reverse orientation for anti-sense) need not be used. For 
example fragments of sufficient length may be used. It is a 
routine matter for the person skilled in the art to screen 
f ragments -of-V-arious sizes and-..£rQm_variaus parts of the coding 
sequence to optimise the level of anti-sense inhibition. It 
may be advantageous to include the initiating methionine ATG 
codon, and perhaps one or more nucleotides upstream of the 
initiating codon. A further possibility is to target a 
conserved sequence of a gene, e.g. a sequence that is 
characteristic of one or more genes, such as a regulatory 
sequence . 

The sequence employed may be about 500 nucleotides or less, 
possibly about 400 nucleotides, about 300 nucleotides, about 
200 nucleotides, or about 100 nucleotides. It may be possible 
to use oligonucleotides of much shorter lengths, 14-23 
nucleotides, although longer fragments, and generally even 
longer than about 500 nucleotides are preferable where 
possible, such as longer than about 600 nucleotides, than about 
700 nucleotides, than about 800 nucleotides, than about 1000 
nucleotides or more. 

It may be preferable that there is complete sequence identity 
in the sequence used for down-regulation of expression of a 
target sequence, and the target sequence, although total 
complementarity or similarity of sequence is not essential. 
One or more nucleotides may differ in the sequence used from 
the target gene. Thus, a sequence employed in a down- 
regulation of gene expression in accordance with the present 
invention may be a wild- type sequence (e.g. gene) selected from 
those available, or a variant of such a sequence in the terms 
described above. The sequence need not include an open reading 
frame or specify an RNA that would be translatable. 

Thus the present invention further provides the use of a 
variant VRN1 nucleotide sequence, or its complement, for down- 
regulation of gene expression, particularly down- regulation of 
expression of the VRN1 gene or homologue thereof, preferably in 
order to inhibit or suppress the vernalisation response in a 
plant . 
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Anti-sense or sense regulation may itself be regulated by- 
employing an inducible promoter in an appropriate construct. 

The present invention also encompasses the expression product 
of any of the coding VRN1 nucleic acid sequences disclosed and 
methods of making the expression product by expression from 
encoding nucleic acid therefore under suitable conditions, 
which may be in suitable host cells. 



As described in the Examples, several features of VRN1 suggest 
that it is likely to serve as a modulator of transcription 
(e.g., as a "co-activator" or "co-repressor " ) , or in the least 
as a DNA-binding protein. These features include the presence 
of the B3 domains; the homology of a portion of region 2 with 
c-myc, a transcription factor; the presence of a putative NLS, 
and the presence of putative signals for rapid protein 
degradation, which are common in transcription factors and 
other proteins of regulatory function (Chevaillier , 1993 ; 
Vierstra, 1996; Barnes and Gomes, 1995; Rechsteiner and Rogers, 
1996; Gomes and Barnes, 1997) . 

The present invention also provides for the production and use 
of fragments of the full-length polypeptides disclosed herein, 
especially active portions thereof. An "active portion" of a 
polypeptide means a peptide which is less than said full length 
polypeptide, but which retains an essential biological 
activity. In particular, the active portion retains the 
ability to alter vernalization response in a plant, such as 
Arabidopsis thaliana . 

A "fragment" of a polypeptide means a stretch of amino acid 
residues of at least about five to seven contiguous amino 
acids, often at least about seven to nine contiguous amino 
acids, typically at least about nine to 13 contiguous amino 
acids and, most preferably, at least about 20 to 30 or more 
contiguous amino acids. 

Use of recombinant VRN1 protein, or a fragment (e.g the domains 
discussed above) thereof, as a DNA-binding protein, or more 
specifically a modulator of transcription, forms one aspect of 
the invention. 

Fragments of the polypeptides may include one or more epitopes 
useful for raising antibodies to a portion of any of the amino 
acid sequences disclosed herein. Preferred epitopes are those 




to which antibodies are able to bind specifically, which may be 
taken to be binding a polypeptide or fragment thereof of the 
invention with an affinity which is at least about lOOOx that 
of other polypeptides. 

Thus purified VRN1 protein, or a fragment or other variant 
thereof, e.g. produced recombinant ly by expression from 
encoding nucleic acid therefor, may be used to raise antibodies 

employing- tcchn i-queg-wM^ h ar e s ta nda rd in -the-ar-t- Ar^t-i-bodrie-s-- 

and polypeptides comprising antigen-binding fragments of 
antibodies may be used in identifying homologues from other 
plant species as discussed above. 

Methods of producing antibodies include immunising a mammal 
(e.g. mouse, rat, rabbit, horse, goat, sheep or monkey) with 
the protein or a fragment thereof. Antibodies may be obtained 
from immunised animals using any of a variety of techniques 
known in the art, and might be screened, preferably using 
binding of antibody to antigen of interest. 

For instance, Western blotting techniques or 

immunoprecipitat ion may be used (Armitage et al , 1992, Nature 
357: 80-82). Antibodies may be polyclonal or monoclonal. 

Antibodies may be modified in a number of ways. Indeed the 
term "antibody" should be construed as covering any specific 
binding substance having a binding domain with the required 
specificity. Thus, this term covers antibody fragments, 
derivatives, functional equivalents and homologues of 
antibodies, including any polypeptide comprising an 
immunoglobulin binding domain, whether natural or synthetic. 

As an alternative or supplement to immunising a mammal, 
antibodies with appropriate binding specificity may be obtained 
from a recombinant ly produced library of expressed 
immunoglobulin variable domains, e.g. using lambda 
bacteriophage or filamentous bacteriophage which display 
functional immunoglobulin binding domains on their surfaces; 
for instance see WO92/01047. 

Specific binding members such as antibodies and polypeptides 
comprising antigen binding domains of antibodies that bind and 
are preferably specific for a VRN1 polypeptide or variant 
thereof represent further aspects of the present invention, as 
do their use and methods which employ them. 
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The above description has generally been concerned with the 
coding parts of the VRN1 gene and variants and products 
thereof. Also embraced within the present invention are 
untranscribed parts of the gene. 

Thus a further aspect of the invention is a nucleic acid 
molecule encoding the promoter of the VRN1 gene, which is 

bel i ev-ed £,0 be present — in the — sequence sho wn in Ann e x — I (whi ch. 

begins at the end of the LARS1 gene) . 

As described in the Examples below, The VRN1 promoter region 
and VRN1 intron 1 were found to contain a variety of potential 
binding sites including low temperature response elements ; 
binding sites for the Arabidopsis dehydration- and ABA- 
responsive gene rd22; one binding site for Arabidopsis Myb2 , a 
transcription factor involved in regulation of genes responsive 
to water stress; H-box and TCA-1 binding sites (that may be 
induced by wounding and abiotic stress) ; and ICE-boxes (a 
consensus promoter element found in several cold- inducible 
genes) . 

These control elements are likely to dictate the conditions in 
which expression of the VRN1 transcript is obtained. For 
example, VRN1 may perhaps be induced by cold and/or drought 
treatment, or simply by application of ABA, and use of the 
promoter or a part thereof for induction of transcription under 
any of these conditions forms one aspect of the present 
invention 

Analysis of the upstream region will reveal control regions for 
gene expression including control regions common to many genes 
(i.e TATA and CAAT boxes) and other control regions, usually 
located from 1 to 10,000, such as 1 to 1000 or 50 to 500 
nucleotides upstream of the start of transcription. To find 
minimal elements or motifs responsible for regulation, 
restriction enzyme or nucleases may be used to digest a nucleic 
acid molecule, or mutagenesis may be employed, followed by an 
appropriate assay (for example using a reporter gene such as 
luciferase) to determine promoter activity. The control region 
may also be mutated to identify specific subregions responsible 
for transcriptional control. This may be achieved by a number 
of techniques well known in the art as such, including DNase 
protection footprint assays, in which the control region is 
brought into contact with an extract from a cell in which the 




VRN1 gene is actively expressed, and the regions of the control 
region which bind factors in that extract is determined. 

Nucleic acid comprising these elements or motifs forms one pare 
of the present invention. 



"Promoter activity" is used to refer to ability to initiate 
transcription under appropriate conditions e.g. optionally in 

•the-prese nce of an indu c er -^ The lev e l o f— pr-omo^^jr— a ct ivi ty -Ls— 

quantifiable for instance by assessment of the amount of mRNA 
produced by transcription from the promoter or by assessment of 
the amount of protein product produced by translation of mRNA 
produced by transcription from the promoter. The amount of a 
specific mRNA present in an expression system may be determined 
for example using specific oligonucleotides which are able to 
hybridise with the mRNA and which are labelled or may be used 
in a specific amplification reaction such as the polymerase 
chain reaction. 



Those skilled in the art are well aware of a multitude of 
possible reporter genes and assay techniques which may be used 
to determine promoter activity. Any suitable reporter/assay 
may be used and it should be appreciated that no particular 
choice is essential to or a limitation of the present 
invention. Also provided is a nucleic acid construct, 
preferably an expression vector, including the VRN1 promoter 
(or active fragment or variant thereof able to promote 
transcription) operably linked to a heterologous gene, e.g. a 
coding sequence, which is preferably not the coding sequence 
with which the promoter is operably linked in nature. 

The invention will now be further described with reference to 
the following non-limiting Figures and Examples. Other 

embodiments of the invention will occur to those skilled in the 
art in the light of these. 



FIGURES & SEQUENCE ANNEXES 



Fig 1: Vernalization phenotype of vrnl mutant under LDs and 
SDs; vernalization phenotype of vrnl-1 allele compared to vrnl- 
2 allele. 



Fig 2: Genetic map of the position of VRN1 on chromosome III in 
relation to markers used for mapping. The markers (shown on 
right) were scored on a population of 494 F2 plants from a 




cross between vrnl-1 facl x fca-10. The distance in cM between 
each marker is shown on the left. 

Fig 3: Physical map of the region containing VRN1 . 

Fig 4: Complementation of the vrnl-1 mutant phenotype by 
cosmids 8H8 and 10F10. Following vernalization, fca-1 plants 
flower early and vrnl-1 fca-1 plants flower late. 
- Representative— T-2— -l-i-Ree- - in which cosmid— &H£ -or— 1-QF-4-Q- ha s-4aeen — 
transformed in vrnl-1 fca-1 plants show the expected ratio 
(approx. 3:1) of early- to- late flowering plants. 

Fig 5: Sequenced region, and predicted ORFs in the vicinity of 
VRN1 . Overlap between cosmids was initially determined by Xbal 
+ Xhol digestion and Southern blotting. Sequencing of cosmid 
DNA confirmed these results and revealed the complementing 
region as 6565 bp. 0RF1 was subsequently shown to be VRN1 . 

Fig 6: Structure of the VRN1 gene and transcript, and positions 
of the vrnl-1 and vrnl-2 mutations. 

Fig 7: The putative VRN1 transcript and its deduced amino acid 
sequence . 

Fig 8: Alignment of VRN1 and RTV1 

Fig 9: Southern blot of genomic DNA from various cereals probed 
with VRN1 cDNA 

Annex I: this shows contig 29 [bp 1501-6500]) derived from Ler 
VRN1 genomic DNA. The VRN1 promoter is present in the region 
between about nucleotides 1 to 1879. 

EXAMPLES 

Example 1 - Isolation of the vrnl mutants 

The vrnl mutation was selected from mutagenized populations of 
Arabidopsis thaliana (L.) Heynh (Landsberg erecta ecotype) 
plants on the basis of its impairment of the acceleration of 
flowering following a six week cold treatment (vernalization) . 
Mutants were subsequently analyzed for flowering time in the 
absence of vernalization in order to confirm that the induced 
defect was specific to the process of vernalization and not due 



to a general late-flowering mutation (Chandler et al . , 1996) 



Two recessive alleles of vrnl have been identified: (1) vrnl-1 
was isolated by mutagenising fca-1 seeds with EMS, as described 
(Chandler et al . , 1996), and (2) vrnl-2 was isolated by 
mutagenising fca-1 seeds with gamma irradiation. The vrnl-1 
fca-1 line used here was backcrossed to fca-1 two times prior 
to genetic mapping. Subsequently, vrnl-1 fca-1 was further 
4>aekeTOaae4^e^^a^H^^-iffles-ifr-tetal)-^ nd vrnl - 2 fca 1 was 
backcrossed two times in total. 

Example 2 - Characterization of the vrnl phenotype 

The vernalization dose - response phenotype of vrnl mutant plants 
was investigated by examining their flowering time in response 
to differing durations of vernalization treatment. Flowering 
time was measured in two ways: (1) as the total number of 
vegetative leaves produced prior to flowering (LN) , and (2) as 
the time in days from the end of the vernalization treatment to 
the appearance of the first floral bud (FT) . In all experiments 
these two measures were positively correlated, so only LN is 
given in order to more easily facilitate comparison between 
experiments . 

Two types of experiment were conducted: (1) a dose-response 
analysis of vrnl-1 fca-1 and vrnl-2 fca-1 examined under long 
day (LD) growth conditions (Figure 1A) , and (2) the effect of 6 
weeks of vernalization on vrnl-1 in the absence of fca-1 
examined under short day (SD) growth conditions (Figure IB) . In 
the LD experiment shown in Figure 1A, with no vernalization (0 
weeks), both vrnl-1 fca-1 and vrnl-2 fca-1 mutant plants 
flowered very slightly earlier compared to the parental fca-1 
controls, although in other experiments vrnl-1 fca-1 and vrnl-2 
fca-1 mutant plants flowered at approximately the same time as 
fca-1 with no vernalization. In contrast, following 
vernalization, fca-1 plants showed a marked reduction in leaf 
number (=66% after 6 weeks of vernalization) , while vrnl-1 fca- 
1 and vrnl-2 fca-1 mutant plants showed a much reduced response 
(=14% and =27% after 6 weeks of vernalization, respectively) . 
Therefore, both alleles of vrnl are dramatically impaired in 
their response to vernalization, with vrnl-1 being more severe 
than vrnl-2. 

In the SD experiment shown in Figure IB, the wild type Ler 



plants exhibited a = 49% reduction in leaf number after a 
vernalization treatment of six weeks compared to unvernalized 
plants. However, vrnl-1 mutant plants showed only a =18% 
reduction under the same conditions. In addition, this 
experiment shows that the phenotype of vrnl-1 does not depend 
on the presence of the fca-1 mutation or on long day 
photoperiods . vrnl-1 was also combined with other late 
flowering, vernalization-responsive mutations (fVe-2, ld-3, 

fwa. = l-j—£e-=-± , - fpa. -2 , and fL-1 ) — and wa s shown Lc j — impair — tire 

vernalization response of these mutants as well (Chandler et 
al . , 1996) . 

Example 3 - Genetic mapping of VRN1 

The VRN1 gene was initially mapped to the top arm of chromosome 
III, between RLFP markers mi207 and mi339, using a relatively 
small F2 population (77 plants) derived from a cross between 
\rrnl-l fca-1 and fca-10, as described (Chandler et al . , 1996). 
A larger population (494 F2 plants) derived from the same cross 
was then used to finely map the position of VRN1 (Figure 2) . 
The dearth of available genetic markers in this region 
necessitated the development of several new genetic markers 
that were polymorphic between the Ler and Ws ecotypes (Table 
1) . As a first step, two markers flanking VRN1 , ATHCHIB (SSLP) 
and g4711 (CAPS) were used to screen the population for 
recombinants in this = 18 cM interval. Approximately 170 
recombinant chromosomes were identified. Next, the markers 
indicated in Figure 2 were used with these recombinants to 
define the position of VRN1 to the =0.5 cM interval between 
mi339 (2 recombinants to the north) and pKS1240 (one 
recombinant to the south). The CAPS marker agpl4, corresponding 
to a dioxygenase gene, was genetically inseparable from VRN1 
(Figure 2) . 

Example 4 - Physical mapping of VRN1 

The interval between mi339 and pKS1240 fell in a gap between 
Contig 3 and Contig 4 of the CIC YAC coverage of chromosome III 
(Camilleri et al . , 1998) so therefore no physical map data was 
available. Initially, an attempt was made to fill the gap using 
YAC clones other than those derived from the CIC library (i.e., 
yUP, EW, and EG) , but this genomic region was apparently not 
represented in any of these libraries. Therefore, a physical 
map of the interval was constructed using IGF (Mozo et al . , 
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1998) and TAMU (http : //genome- 
www . Stanford . edu/Arabidopsis/ww/Vol2 /choi . html ) BAC clones. 
Marker mi339 was used to screen the BAC libraries and to 
initiate a walk towards pKS1240. BAC contigs (Figure 3) were 
assembled by hybridizing BACs to end probes developed by iPCR 

(Table 2) and by using publicly available BAC end sequence data 

(from TIGR; 

http : / /www . t igr . org/tdb/at/atgenome/bac_end_search/bac_end_sear 
^twhfcmU-as- the b a sis for d esignin g n Ugonuclcot i dc primer s f or 



PCR (Table 2) . The size of individual BAC clones was determined 
by hexagonal pulsed-field gel electrophoresis (Maule, 1997) . 
The =0.5 cM interval between mi339 and pKS1240 containing the 
VRN1 gene was therefore found to correspond to =12 0 kb of 
genomic DNA. 

In preparation for cosmid complementation experiments, a Ler 
genomic library in the cosmid 04541 binary vector (Macknight et 
al., 1997) was initially screened using the following probes: 
BAc't24F13, mi339 , agpl4 , and pKS1240. Putative positive 
clones were verified on Southern blots and the overlap between 
individual cosmids determined by either hybridization with DNA 
probes or with PCR primers designed from BAC- and cosmid- end 
sequence data (Table 2) . The insert sizes of individual cosmid 
clones was determined by digestion with Xbal + Xhol followed by 
standard agarose gel electrophoresis using lambda DNA cut with 
Hindlll as a standard. A complete cosmid contig was generated 
over the =12 0 kb region (Figure 3) . 

Example 5 - Cosmid complementa t ion of the vml phenotype 

Eight cosmids (39K3, 8H8, 10F10, 42A10, 2P5, 19D3, 27J7, 67N6) 
centered around the marker agpl4 were transformed into vrnl-2 
fca-1 plants by Agrobacterium tumefaciens- infection of root 
tissue (Hooykaas, 1989) . In order to test if any of these 
cosmids rescued the mutant phenotype of vrnl-1 fca-1, T2 seed 
(from individual Tl kanamycin resistant transf ormants) was sown 
on soil and vernalized for 5 weeks. Seedlings were then 
transferred to LD conditions, and pricked out into individual 
compartments of divided trays after about a week of growth. The 
total leaf number prior to flowering was determined, and 
cosmids were scored as complementing if the segregation ratio 
of early to late plants (compared to fca-1 and vrnl-1 fca-1 
controls) was approximately 3 : 1 or greater. Eight independent 
lines containing cosmid 8H8, eight independent lines containing 
cosmid 10F10, and three independent lines containing cosmid 




39K3 were found to rescue mutant phenotype of vrnl-l fca-1. 
Lines containing the other five cosmids did not complement the 
vrnl-1 phenotype (Figure 3) . Analysis of the flowering time 
segregation in typical 8H8 and 10F10 complementing lines is 
5 shown in Figure 4. The presence of each cosmid in complementing 

lines (T2 plants) was confirmed by a cosmid- specif ic diagnostic 



PCR, comprising an insert specific primer 8H8DIAG1 
-( AeereeTTeTQe CAACCOCTC ) — and 1 QF 10DIA G1 



10 (AGTTCGCTCTTGCTGTTTTTTTTCCC) (corresponding to a portion of the 

Ler genomic DNA) and a primer BACT 7U (CCTCTTCGCTATTAGGCCAG) 
present in the cosmid vector (see "cosmid complementation" 
under "materials and methods" below) . 

15 Example 6 - Analysis of genomic DNA corresponding to the 

complementing region 

(a) Sequencing of cosmid DNA 

20 The region of chromosome III corresponding to the cosmid contig 

surrounding VRN1 (Figure 3) had apparently not previously been 
sequenced. Therefore the insert DNA from cosmids 8H8 , 10F10, 
and 39K3 (derived from Ler genomic DNA) was sequenced by a 
combination of primer walking and shotgun strategies (Table 3), 

25 resulting in three contigs of sequence (Figure 5) . The total 

amount of new Arabidopsis genomic sequence obtained was 2 0 950 
bp . 

(B) Identification of candidate ORFs in genomic sequence 

m 

w As new genomic sequence data was obtained it was analyzed m 

severa l ways in order to identify potential open reading frames 
(ORFs) and genes. Firstly, homology searches were carried out 
using the BLAST and FASTA computer programs available from the 
35 Arabidopsis thaliana Database (AtDB; ht tp :/ /genome - 

www.stanford.edu/Arabidopsis/seqtools.html) and National Center 
for Biotechnology Information (NCBI; 
http: //www.ncbi .nlm.nih.gov/ 

BLAST/) . Using these programs, genomic sequence in the VRN1 
4 0 region was compared (1) the Arabidopsis EST database, and (2) 

the database of all non-redundant Genbank sequences. Secondly, 
searches were carried out using the NetPlantGene 
(http : //www. cbs . dtu. dk/ 
NetPlantGene.html), BCM Gene Finder 
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(http : / /dot . imgen.bcm.tmc . edu : 93 31 /gene- f inder/gf . html) , and 
GENESCAN (http : //gnomic . Stanford . edu/GENSCANW . html ) computer 
programs which are designed to recognize features of eukaryotic 
genes, such as intron-exon boundaries, ORFs and polyadenylat ion 
signals. The results of these analyses are summarized in Figure 
5 ("Predicted ORFs"). The sequenced region (contigs 29, 2, and 
4) was found to contain = 8 potential genes. Three of these, 
agpl4, LARS1 , and ORF1 (later identified as VRN1) were 
■^ep^esenfeed-^y-^STs- i-nr—the— GenBa nk EST -d:ata&aH&e-r~ 



10 

Example 7 - Identification of the v rnl-1 and vrnl-2 mutations 
and determination of the VRN1 gene structure 

(A) Finding mutations in vrnl mutant plants 

The three cosmids which rescued the vrnl-1 fca-1 mutant 
phenotype (8H8, 10F10, 39K3) were subjected to restriction 
analysis using Xbal and Xhol (Figure 5) and the region of 
overlap between these three cosmids found to be =6.5 kb. The 
ORF analysis indicated that this 6 . 5 kb interval contained the 
3' end of the LARS1 gene (a dioxygenase closely related to 
agpl4), the 3' end of a hypothetical gene of unknown function, 
and the entire structure of another gene, "ORF1" (Figure 5) . 

2 5 In order to determine if either LARS1 or ORF1 corresponded to 

VRN1, a search for the presence of mutations in these genes in 
vrnl-1 fca-1 and vrnl-2 fca-1 mutant plants was carried out. 
PCR primers initially used in the sequencing of cosmid DNA 
(Table 3) were now used to amplify products from vrnl-1 fca-1 
and vrnl-2 fca-1 genomic DNA. Overlapping products that 
encompassed the entire predicted ORF of LARS1 and ORF1 were 
sequenced on both strands and compared to the Ler-derived 
cosmid sequence for the presence of differences corresponding 
to mutations. No mutations were found in the LARS1 gene, but in 
35 ORF1, a lbp nonsense mutation was found in vrnl-l f ca-l-derived 

DNA and a lbp deletion was found in vrnl-2 f ca-l-derived DNA 
(Figure 6) . Each of these putative mutations were then 
confirmed by sequencing four more independent PCR products on 
both strands. The effect of the vrnl-l and vrnl-2 mutations on 
the encoded VRN1 protein is described in Example 8. 



(B) Determining the structure of the VRN1 gene 



The structure of the VRN1 gene and putative transcript was 
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determined by a combination of (1) RT-PCR analysis, (2) 3 1 -RACE 
analysis, and (3) analysis of partial cDNA clones represented 
in the GenBank Arabidopsis EST database (Table 4) . These 
techniques revealed the sequence of the VRN1 transcript and by 
5 comparing this sequence with the VRN1 genomic sequence, the 

intron/exon boundaries were determined (Figure 6, Figure 7). 
The results obtained by these approaches were all in agreement, 
i.e., the intron-exon boundaries and point of polyadenylat ion 

--de t e rm ined—by— R T - PCR and— 3 1 — RACE w o r e id e ntical to tho se — 

10 determined through analysis of EST clones corresponding to VRN1 

cDNA, although the 5' transcription start site of the VRN1 gene 
was not definitively determined by the experiments. Within the 
putative VRN1 transcript (Figure 7) , the Ler-derived sequence 
obtained by RT-PCR and the Columbia -derived sequence obtained 
^5 by the sequencing of EST Clones were 100% identical. 

The VRN1 gene is comprised of 5 exons and covers =3.0 kb of 
genomic DNA from the putative transcription start to the point 
of polyadenylation (see Annex I) . Introns 2, 3, and 4 are a 
20 typical size for an Arabidopsis gene (=100 bp), while intron 1 

is quite large: =1.2 kb (Figure 6) . The 5- and 3-UTR of the 
VRN1 transcript are also somewhat larger than average: =270 and 
=200 bp, respectively (Figures 6, 7). 

2 5 The VRN1 promoter region (from end of LARS1 gene to VRN1 

translation initiation codon) and VRN1 intron 1 were analyzed 
for binding sites of known plant transcription factors and 
known promoter elements using the Web Signal Scan program and 
PLACE database 

^0 (http : //www . dna . af f rc . go . jp/htdocs/ PLACE/ signal scan . html) . 

These regions of VRN1 , which may potentially specify the 
expression of the VRN1 gene, were found to contain the 
following potential binding sites: (1) two low temperature 
response elements (LTREs ; also known as CRT/DREs) , found in 

35 several cold-induced genes from Arabidopsis , Brassica napus, 

and barley and bound by the transcription factor CBF1 (Baker et 
al . , 1994; Stockinger et al . , 1997; Jiang et al . , 1996; Nordin 
et al . , 1993), (2) three binding sites for the Arabidopsis 
dehydration- and ABA- responsive gene rd22 (Abe et al . , 1997), 

40 (3) one binding site for Arabidopsis Myb2 , a transcription 

factor involved in regulation of genes responsive to water 
stress (Urao et al . , 1993), (4) one H-box and three TCA-1 
binding sites, promoter elements found in several tobacco, 
barley, and bean (P. vulgaris) genes that are induced by 




wounding and abiotic stress (Loake et al . , 1992 / Mhiri et al . , 
1997; Goldsbrough et al . , 1993), and (5) three ICE-boxes, a 
consensus promoter element found in several cold- inducible 
genes (G.J. Warren, unpublished) . It is interesting that 
5 control elements for both cold- and drought - inducible genes are 

present within the VRN1 promoter and intron 1, as these 
conditions are known to induce several genes involved in 
acclimation to freezing temperatures (Thomashow, 1994) , and ABA 

-signaling -is — i-iwel-ve^ — fGi lmour and Thomashow-, — 1991 ) . — Th e s e 

10 control elements are likely to dictate the conditions in which 

expression of the VRN1 transcript is obtained. For example, 
VRN1 may perhaps be induced by cold and/or drought treatment, 
or simply by application of ABA. 



15 Taken together, the presence of mutations within ORF1 (the only 

predicted gene that was completely contained within the 
complementing region) in genomic DNA derived from vrnl-l and 
vrnl-2 mutant plants confirmed that ORF1 corresponds to the 
VRN1 gene . 

20 

This may be readily confirmed by introduction of the ORF (in 
sense and antisense) into Arabidopsis (cf . Example 5 above) . 
Constructs may be based on the pGreen0029 vector which drives 
expression of the cloned in gene with a double 35S promoter and 
25 terminator derived from CaMV. This vector, and how to obtain 

it, is discussed in detail in WO 99/27120 (Plant Bioscience 
Limited) . 

(1) Genomic sense construct: the unspliced (genomic) VRN1 ORF 
30 in the sense orientation in order to produce high levels of 

functional VRN1 product. This construct will be put into vrnl-1 
fca-1 and vrnl -2 fca-1 plants. 

(2) cDNA sense construct: the spliced (cDNA) VRN1 ORF in the 

35 sense orientation in order to produce high levels of functional 

VRN1 product. This construct will be put into vrnl~l fca-1 and 
vrnl-2 fca-1 plants. 

(3) cDNA antisense construct: the spliced (cDNA) VRN1 ORF in 
40 the antisense orientation in order to repress the normal 

expression of VRN1 and diminish the amount of functional VRN1 
product. This construct will be put into fca-1 and Ler plants. 



As an alternative to a constitutive promoter, it may be 
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desirable to use an inducible promoter, such as one which is 
controlled by application of the molecule dexamethasone. 

Example 8 - Analysis of the putative protein encoded by the 
VRN1 gene 

The deduced amino acid sequence of VRN1 (Figure 7) was compared 
with the entire GenBank database (NCBI) using the BLASTP and 
TBLASTN p r ogr am s . 



(A) VRN1: domain structure, sequence features, and similarity 
to other known and hypothetical sequences 

The VRN1 gene encodes a putative protein of 341aa (calculated 
15 MW-39278 Da) that is basic (pl=9.1), and is comprised of at 

W least three regions. Region 1 (residues 2-94 in Figure 7) and 3 

(residues 239-332) which are homologous to each other and are 
related to the B3 DNA-binding domain originally found in the 
maize transcription factor VIVIPAROUS1 (VP1; McCarty et al . , 
20 1991) . Domains similar to the B3 domain of VP1 have 

subsequently been found in several Arabidopsis transcription 
factors or putative transcription factors such as ABI3 (the 
Arabidopsis orthologue of maize VP1, (Giraudat et al . , 1992), 
auxin response factors (Ulmasov et al . , 1997), IAA response 
25 factors (Kim et al . , 1997; Abel et al . , 1994; Guilfoyle et al . , 

1998) , FUSCA3 (Luerssen et al . , 1998), and RAVs (Kagaya et al . , 

1999) . Several of these proteins have been shown to bind DNA in 
sequence-specific manner via their B3 domain (e.g., Kagaya et 
al., 1999; Suzuki et al . , 1997; Ulmasov et al . , 1997). 



The B3 DNA-binding domain appears to be specific to plants 
(Suzuki et al . , 1997), and analysis of translated nucleotide 
sequences (i.e., hypothetical proteins) in the GenBank 
databases has revealed at least 22 Arabidopsis sequences that 
35 contain B3 domains, as well as EST sequences from several other 

plant species such as Brassica oleracea , hybrid aspen (Populus 
tremula x P. tremuloides) , and tomato. While VRN1 contains two 
B3 domains, most characterized and hypothetical amino acid 
sequences were found to contain only one B3 domain, and some 
4 0 were found to contain more than two. The B3 domain appears to 

be "defined" by a number of conserved positions (results not 
shown) rather than sequence identity over the whole domain. 
Therefore, BLAST scores between the sequences shown tested are 
only marginally significant (on the order of 10" 6 to 10"* 1 ) . 
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The C-terminal portion of the B3 domain is more conserved than 
the N-terminal portion. 

Phylogenetic analysis of B3 domains (results not shown) using 
5 the Clustal method suggests that Arabidopsis B3 - containing 

proteins fall into several groups: (1) ABI3- and FUSCA3-like 
B3s, (2) auxin response factor- (ARFs) and IAA- inducible 
protein-like B3s, (3) RAVl-like B3s, and (4) at least four 

unoharaoter-i-z-ed - g-roups-7 - which i nc 1 ud e t h e -VRN-1--1 ike B 3 — I-fc — i-s — 

10 likely that through evolution the B3 domain has been recruited 

in different ways by proteins involved in diverse plant 
processes . 

Region 2 of VRN1 (residues 95-238) , which lies between the tv:c 

15 B3 domains (Figure 7), is not obviously related to any domain 

of known function, nor does it have obvious features of a 
transcriptional activation or repression domain. Nonetheless, 
region 2 does contain several sequence features and motifs of 
interest, including a putative nuclear localization signal 

20 (NSL) , two putative PEST regions (identified using the PEST 

Sequence Utility (http : //www . lif . icnet . uk/LRITu/pro j ects/pest /) 
based on Rechsteiner and Rogers, 1996; Rogers et al . , 1986), 
and three RXXL motifs also associated with rapid protein 
degradation (Cooper et al . , 1997) (Figure 7). Interestingly, 

25 the second PEST region of VRN1 contains a potential protein 

kinase C (PKC) phosphorylation site (residues 176-178 in Figure 
7) . There are several examples in the literature for regulation 
of the cellular "lifespan" of proteins by phosphorylation of 
PEST regions (e.g., McKinsey et al . , 1997; Koepp et al . , 1999; 

30 Yaglom et al . , 1996; Marchal et al . , 1998; Liu et al . , 1997). 

For example, in the case of IkB, stimulation of cell surface 
receptors by cytokines initiates a signal transduction cascade 
that phosphorylates IkB at two specific serine residues in the 
PEST region, triggering the polyubiquit inat ion of nearby lysine 

35 residues and ultimately proteolysis (McKinsey et al . , 1997; 

Laney and Hochstrasser , 1999) . 

Analysis of the physiochemical characteristics of VRN1 suggest 
that the two B3 domains are basic (average pl=9.5) and slightly 

40 hydrophobic in character, while region 2 is slightly acidic 

(pl~6.3) and somewhat hydrophilic and therefore likely to be on 
the surface of the molecule and exposed to the aqueous 
environment. Interestingly, unlike the B3 domains which appear 
to be specific to plants, BLAST searches against GenBank (NCBI) 

45 with region 2 of VRN1 picked up no significant hits from plants 
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(except for RTVl , see below) but did reveal weak homology 
between the N-terminal portion of region 2 (residues 109-167) 
and a region of the vertebrate proto-oncogene transcription 
factor c-MYC (Schmidt, 1999) . Furthermore, this region of c-myc 
lies in the linker between the DNA-binding domain and the 
transcriptional activation domain (Kerkhoff and Bister, 1991; 
Classon et al . , 1993) and is not required for the oncogenic 
transformation activity of the protein (Stone et al . , 1987). By 
-analogy , this -port-ion -o£- -r^eg i on - 2 ..-of --VRN 1 jrtay-^simi 1 ar-ly- serve 



10 as a linker region of no great importance to VRN1 function. 

Alternatively, region 2 may function as a novel type of 
transcriptional activation or repression domain, or in some 
other, unknown, function of VRN1 . Table 4 gives information on 
the sequences which were used in comparisons with VRN1 . The 

15 RTVl gene is discussed below. 

(B) Effect of the allelic mutations on VRN1 

The mutations found in the two mutant alleles of vrnl (Figure 
20 6) and the effect of these mutations on the resulting encoded 

protein can be correlated with the phenotypic severity, i.e., 
effect on vernalization response (Figure 1A) , of the two 
alleles. As shown in Figure 7, the vrnl-1 allele encodes a 
polypeptide of only 47 aa, and the vrnl-2 allele encodes a 
25 polypeptide of 194 aa (the last six of which are incorrect due 

to a frameshift) compared to 341 aa for the wild-type protein. 
The fact that the polypeptide encoded by vrnl -2 contains the 
first B3 domain as well as the putative PEST regions and NLS 
but is only slightly less severe in its effect on vernalization 

*0 than the vrnl-1 allele (Figure 1A) , suggests that the second B3 

DNA-binding domain may be required (but not necessarily 
sufficient) for VRN1 function under the conditions used. 



(C) The RTVl gene, a relative of VRN1 



Despite the presence of many plant proteins that contain the B3 
domain, only one putative protein sequence has been found with 
a domain structure identical to VRN1 , i.e., containing regions 
1-3 in the same configuration and with no additional domains. 
40 The gene encoding this protein, which is represented in the 

Arahidopsis EST database (Table 4) , has been named RTVl 
(related to VRN1 ) . .RTVl, which is on IGF BAC clone F13F21 on 
chromosome 1, encodes a protein of 3 01 aa which is very similar 
to VRN1 (Figure 9 and Table 5) . While the overall similarity 




between RTV1 and VRN1 is 74% (within the coding region) , the 
similarity is greatest at the C-terminal end, with region 3 of 
RTV1 and VRN1 being 99% similar (Figure 9 and Table 5) . Outside 
of the coding region (i.e., in the UTRs , promoter region and 
5 introns) , the VRN1 and RTV1 genes appear to be unrelated. 

However, the intron/exon organization of the RTV1 gene is 
similar to VRN1 and therefore the two genes are likely to be 
the result of a duplication event. The most notable difference 

— ire fcvrcen-^V RNl and RTV1 — is t he d e l e tion of 3 3 — amino aci ds i n th e 

10 first B3 domain of RTV1 (Figure 9) . It is worth noting that 

this deletion does not affect the C-terminal, most conserved, 
portion of the B3 domain (Figure 9) . 

The finding of a gene that is very closely related to VRN1 
15 suggests that RTV1 may serve a function in vernalization 

response or other aspect of flowering time control. Since the 
vrnl-1 allele encodes a short polypeptide of only 47 amino 
acids, with no complete putative DNA-binding domain (Figure 7) 
it is likely to encode a non-functional polypeptide. The fact 
20 that vrnl-1 mutant plants still retain a small response to 

vernalization (Figure 1) suggests the presence of other 
Arabidopsis factors that can partially substitute for VRN1 
function. Since RTV1 is a closely-related VRN1 paralogue in 
Arabidopsis it may be such a factor. 

25 

Another factor that may possibly be responsible for this 
functional redundancy is VRN2 (Chandler et al . , 1996). Like 
vrnl-1 fca-1 mutants, vrn2-l fca-1 mutants also retain a 
partial response to vernalization, but vrnl-1 vrn2-l fca-1 
3 0 triple mutants do not (data not shown) . If it is assumed that 

vrnl-1 is a "null" mutation, then this result suggests that 
VRN1 and VRN2 act in separate and partially redundant 
vernalization-promoting pathways . 

3 5 Example 9 - Detection and isolation of VRN1 - related genes from 

other plant species 

A high-stringency Southern blot of genomic DNA from various 
cereals, when hybridized with a probe corresponding to the VRN1 
40 transcript, specifically detected VRIV2- related genes in millet, 

sorghum, barley, rice, wheat, and maize (see Figure 9). 



To prepare the blot, approximately 10 jig of genomic DNA from 
each of these varieties was digested with Eco RI (37°C, 
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overnight) . DNA samples were separated by gel electrophoresis 
on a 0.8% agarose gel run at 50V for 16 hours. The gel was 
then processed for Southern blotting by standard procedures 
(see Maniatis, supra) and DNA was blotted overnight onto a 
nylon membrane (Hybond-N, Amersham) . Following blotting, the 
DNA was cross-linked to the filter by exposure to UV light 
according to the manufacturer's recommendations and baked at 
80°C for 2 hrs. 



10 The VRN1 cDNA probe V2V6 was prepared by amplifying an aliquot 

of the first-strand cDNA synthesis from total RNA of 
Arabidopsis seedlings with the oligonucleotide primers V2 and 
V6 . The resulting PCR product was purified by agarose gel 
electrophoresis and labeled with 32 P-dCT? by the random hexamer 
^5 priming method (see Maniatis, supra). 

Hybridisation of the filter with the radiolabel led probe, and 
subsequent washes, were under standard high stringency 
conditions using buffer comprising 5 x SSC, 5 X Denhart 1 s 
20 solution, and 0.5% SDS at 65°C for 16 hours. The filter was 

then washed sequentially in (1) 2XSSC, 0.1% SDS at room 
temperature for 10 minutes; (2) 1XSSC, 0.1% SDS at 65°C for 15 
minutes; and (3) 0.1XSSC, 0.1% SDS at 65°C for 10 minutes. 

25 The washed filter was exposed to a Phosphorlmager plate 

(Molecular Dynamics) for 3 days prior to visualisation. 

In the light of the results above, in addition to the monocots, 
it is highly probable that ViWl-related genes will be found to 
£p exist in agronomically important dicot species (e.g. 

Brassicaceae , sugarbeet, peas and celery etc.) 

Thus the provision of sequence information for the VRN1 gene of 
Arabidopsis thaliana enables the obtention of homologous 

35 sequences from cDNA or genomic libraries from other plant 

species, such as can be prepared or obtained by the skilled 
person without undue burden. Positive clones can be further 
analyzed by restriction endonuclease digestion and Southern 
blotting as described hereinbefore. Particularly preferred are 

40 homologues from commercially important species that have a 

vernalization requirement, or show some response to 
vernalization . 



Materials and Methods used in Examples 



Plant growth 
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For vernalization treatments, seeds were sown on fine grit 
(Levington's M3 ) in individual pots, and germinated for 
5 increasing durations at 4°C, 8hr light :16hr dark, 5 mmol m" 2 
sec" 1 light intensity. For dose-response experiments seed 
sowing was staggered, with all plants removed from the 
vernalization conditions simultaneously. Following 
ye ma 1 i za t i on , — seedlings were plac -e d in to a control l ed 

10 environment chamber (Gallenkamp) , 20°C, 16 hr light: 8hr dark 

90 mmol m~ 2 sec" 1 light intensity. Seedlings receiving no 
vernalization treatment were stratified for 2 days under 
vernalization conditions, and grown for two days prior to 
transfer in to the growth cabinet. Plants were grown for 10 

15 days, and then pricked out into individual compartments of P4 0 

trays. Flowering time was measured by counting total leaf 
number (i.e. rosette and cauline leaves) by marking the leaves 
with permanent black ink as they emerged. 

2 0 Genetic mapping 

VRN1 was initially positioned on Chromosome 3 through linkage 
to RFLP markers mi339 and mi207 (Liu et al . , 1996), in F2 
progeny (154 chromosomes) of across between vrnl-1 fca-1 (Ler 

2 5 background) and fca-10 (Ws background) , as described in 

(Chandler et al . , 1996). As a first step in refining this map 
position, two existing RLFP markers in the region (g4711 and 
m560B2; Chang et al . , 1988), and two existing SSLP markers in 
the region (ATHCHIB and ngal62; Bell and Ecker, 1994), were 

30 scored on a larger F2 population (988 chromosomes) of the same 

cross as above. In order to refine the map position of VRN1 
further, new genetic markers that were polymorphic between Ler 
and Ws were developed (Table 1). Standard techniques (e.g., 
restriction digestion, 32 P-labeling of probes, agarose gel 

35 electrophoresis, Southern blotting, and Phosphorlmager 

detection) were used throughout. 

Physical mapping 

40 YAC, BAC and cosmid clones and libraries were handled, 

analyzed, and hybridized according to standard procedures 
(Schmidt and Dean, 1995; Bent et al . , 1998; Macknight et al . , 
1997) . As with the genetic mapping of VRN1 , some probes and PCR 
markers were existing and available, and some were developed in 
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order to establish or refine the overlap between clones. The 
following probes and PCR markers were existing and available: 
mi289, GBGe303, MSH2 , ve039, 011339, agpl4, MAP2K, sAT2105b, and 
m506B2. New probes and PCR markers developed in order to 
5 identify the VRN1 gene are listed in Table 2. New probes and 

PCR markers were developed by three methods: (1) iPCR of BAC 
ends, (2) design of PCR primers based on BAC end sequence data 
(from TIGR; http://www.tigr.org/tdb/at/atgenome/ 

bac _end : _3 e Qr €fe/ bQC_ o nd_G earch 4itml ) , — and — (-3-) — sequencing of — 

10 cosmid ends and design of PCR primers based on the obtained 

data . 



(A) iPCR of pBelo-BAC ends 



15 The following procedure is a modification of a protocol 

£ received from T. Altmann (MPI, Golm, Germany) . DNA from l/10th 

of a 25 ml BAC overnight culture was digested with (1) Hhal or 
EcoRI or Hindi or Rsal for the T7 end, or (2) Hhal or Haell or 
EcoRV for the Sp6 end, and phenol chloroform extracted and 

20 ethanol precipitated. Digested material was ligated in a 100 fil 

standard reaction with T4 DNA ligase, heat inactivated, and 
ethanol precipitated. Ligation products were digested with Pvul 
for the T7 end, and BsrBI for the Sp6 end in a 15 fil reaction 
volume. For PCR, 1 fil of digestion reaction was amplified in a 

25 standard reaction using (1) primers BACT7U and BACT7L for the 

T7 end, or (2) primers Sp6A and Sp6B for the Sp6 end. 

BACT7U 5 1 - CCTCTTCGCTATTACGCCAG - 3 ' 

BACT7L 5'- GCCCTTCCCAACAGTTCG -3 1 

30 Sp6A 5'- CACACAGGAAACAGCTAT -3 1 

W Sp6B 5*- ACACAACATACGAGCCGGAA -3' 

(B) Sequencing of cosmid DNA and PCR products 

35 Genomic sequence was obtained from the ends of cosmid insert 

DNA using the BIGDYE cycle sequencing kit (Perkin Elmer Applied 
Biosystems) , and T3 and T7 primers, whose sequences flank the 
genomic DNA insert site. For sequencing regions further into 
the cosmid insert DNA, and for sequencing PCR products 

40 amplified off of genomic DNA from the vrnl-1 and vrnl-2 

alleles, the oligonucleotides shown in Table 3 were used. The 
reactions were run on an ABI377 machine, and compiled using the 
SeqMan (DNAStar, Lasergene) program. 
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Cosmi d compl em en ta t i on 
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Cosmids in the 04541 binary vector were mobilized into 
Agrobacterium tumefaciens (strain C58C1 Rif R ) by tri-parental 
mating (Hoekema et al . , 1983). \rrnl-l fca-1 plants were 
transformed with these Agrobacteri um strains by root infection 
5 (Hooykaas, 1989). Transgenic plants were selected on GM with 
Kanamycin (50 mg/mL) , and transferred to soil when they had 
reached the 3-4 leaf stage. The presence of each cosmid in the 
transgenic lines was confirmed using a specific diagnostic PCR 
^eartion) — usd-ng^- a- prim e r -prese nt within th e cosmid -inse rt 

10 sequence and a primer present in the cosmid flanking the 

insert site. T2 seed were collected, and analyzed for the 
segregation of Kanamycin resistance or sensitivity on GM plates 
containing Kanamycin (as above), scored 14-20 days after 
germination. Lines that segregated a 3:1 ratio of resistant tc 

15 sensitive plants were tested for their ability to complement 

the vrnl-l mutant phenotype, by vernalizing for 5 weeks and 
recording the total leaf number. 



20 



RT-PCR and 3 ' -RACE 



In order to determine the intron-exon structure of the VRN1 
gene, RT-PCR reactions using total RNA prepared from fca-1 and 
vrnl-1 fca-1 seedlings grown on soil were performed according 
to standard procedures (Frohman et al . , 1988) . The PCR products 

2 5 were sequenced using both the primers used for PCR, and 

selected internal primers, using the BIGDYE kit (PE Applied 
Biosystems) . The reactions were run on an ABI377 machine, and 
compiled using the SeqMan (DNAStar, Lasergene) program. 

3 0 Sequence comparisons 

The nucleic acid sequence comparison in Table 5 was by using 
the Jotun Hein method (weighted residue table) of MegAlign 
(DNAstar) . Genomic and cDNA sequences were aligned using the 
3 5 BLAST 2 SEQUENCES program 

(http : //www.ncbi .nlm.nih.gov/gorf/bl2 .html) from NCBI . 

Parameters are preferably set, using the defaults, as follows: 

40 Gap penalty: 11 

Gap length penalty: 3 
KTUP word length: 6 



Amino acid sequences were initially aligned using the Clustal 
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method, using the PAM 250 residue weight table, and further 
adjusted manually. For amino acid similarity comparisons, 
amino acids were grouped into five classes on the basis of 
physiochemical properties, as follows: (1) hydrophobic - G, A, 
5 V, P, M, I, L; (2) polar - S, T, N, Q, C; (3) bulky ring - Y, 

F, W, H; (4) positively charged - K, R; (5) negatively charged 
- D, E. 
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Table 1. Genetic markers developed in order to identify the 
VRN1 gene . 





Marker 


Type 


Ler/Ws polymorphism 


5 


GBGe3 03 


RFLP 


Sspl; Ler band < Ws band 




MSH2 


CAPS 


Sau3A; Ler: 2 sites, Ws: 
no sites 

Bfal; Ler: no sites, Ws : 1 








site 




ve039 


CAPS 


Rsal ; Ler: 3 sites, Ws: 2 
sites 




agpl4 


RFLP 


Hpall; Ler band < Ws band 




pKS1240 


RFLP 


Dral ; Ler band > Ws band 


10 


MAP2K 


RFLP 


Haelll; Ler band < Ws band 
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Table 2. Physical mapping markers developed in order to 
identify the VRN1 gene. 



Marker 



Type 



Method used to develope 



10 



15 



20 



T4L24-T7 
T7H5-Sp6 
T15C16-Sp6 



T10N5-T7 
T24F13-T7 

8H8 -T7 

F25C7-T7 

10F10-T3 

8H8-T3 

F18G1-T7 

10F10-T7 

T4L24-Sp6 

T2 0A21-Sp6 

T7H5-T7 

T15C16-T7 

F10N5-Sp6 

T24F13-Sp6 

F28N8-Sp6 

F5G10-T7 



Southern probe iPCR 
Southern probe iPCR 
Southern probe iPCR 



Southern probe 
PCR diagnostic 

PCR diagnostic 

PCR diagnostic 

PCR diagnostic 

PCR diagnostic 

PCR di agnos t i c 

PCR diagnostic 

PCR diagnostic 

Southern probe 
Southern probe 
Southern probe 
Southern probe 
Southern probe 
PCR diagnostic 

PCR diagnostic 



~±¥CR 

BAC end sequence data 
(public) 

cosmid end sequence 
(obtained) 

BAC end sequence data 
(public) 

cosmid end sequence 
(obtained) 

cosmid end sequence 
(obtained) 

BAC end sequence data 
(public) 

cosmid end sequence 
(obtained) 

BAC end sequence data 
(public) 
iPCR 
iPCR 
iPCR 
iPCR 
iPCR 

BAC end sequence data 
(public) 

BAC end sequence data 
(public) 



25 
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Table 3. Oligonucleotides developed to identify the VRN1 gene 
The positive ( + ) strand oligos correspond to the forward, or 
mRNA, strand of DNA, and the negative (-) strand oligos 
correspond to the reverse, or coding, strand of DNA. The 
5 position indicated in the table refers to the nucleotide 

position in the VRN1 genomic sequence (Annex I) of the 5' end 
of the oligo. 



ulig o St r a 'fosici bequ e nee to J 1 ) 

nd on 





Q 9 
O D J 


+ 


O 3 U 


paa ppptt a. en en a aar 

Lrtrtbbb 1 1 rtbbbLrtrtrtb 




c; & a 




ft £ £ 
ODD 


bill bbbb lrt.rt.LLb lib 




VI 1 

V 11 


+ 


1 1 Q-) 
1 1 ? J 


b/ibHLLHb 1 1 1 Ibl 1 1 ILL 




Q & 9 
O O *L 




1 9 9 Q 
X Z. Z. z) 


pAfA A ATATATPTPPA A App 
brtb/-Lrt_rt. 1 rt. 1 rtbb 1 bbrtJ-irt.bb 




ODO 


+ 


_L f± ± 


A A APPPPA PTAPPTPPP 
rtrt/\bbbbrtb 1 rtbb 1 bbb 


-L D 


V / 


+ 


± O _L 1 


L1L1L1 bb 1V_.11L1L.11L 




V1U 




± O Z O 


PA APAPA APAPPAPAPAP 




V D 


+ 


10 0 9 

± -7 U / 


TTTTPTPA TPPA PTATPP 
1 1 1 1 L 1 brt. 1 L Lrt.L 1/ilLL 




o 3 J_ 




i q 9 n 

1 JjU 


T T T P T TP P A T A P TPP A T P A P 
1 1 1 L 1 1 bb-H. 1 rtb 1 bbrt. 1 brt.b 




Q £ R 

ODD 




_L O o 


rt>\rtrt.brtbbbrtH.brt.b 1 rt-rtb.H»rt.b 


9 n 


c c 9 

O 3 ^ 




9 9 7 0 


1 1 bb 1 Iblbl 1 1 bb 1 bbb 




V 3 




9 cr Q Q 

3 -7 -7 


ppt p t p t a t r9i t a t t p tp p 

bb iLlLlHlulnl Ibl bb 




v *i 




<C U 1 o 


Ppapaz\T2\P2\TAPAPAPP 

VJ7V_rt.V_.rtrt. X .rt.v_.rt. J. rt.VJ7rt.Ort.Vw V- 




V12 




2846 


AGATTGATTACACGACTCC 




V8 


+ 


3125 


CCCAGATAAGTTTGTGAG 


25 


V3 


+ 


3391 


ATTCCGCTCACAACCAC 




V15 




3414 


GTTTGAAGTGGTTGTGAG 




V14 


+ 


3477 


TACCCATCACCACTTCC 




S60 




3474 


CAGAAGAAGGAAAGATGACC 




S61 


+ 


3927 


GAAGAAAGAGAGAGAGCC 


30 


V13 


+ 


3976 


ACCCTTTCTTCAGAGTG 




V9 




3942 


CTCTCTCTCTTTCTTCTG 




VI 6 




3993 


CCACTCTGAAGAAAGGG 




S46 


+ 


4096 


CCTTCTGTTTCTGTTTCTC 




S45 




4114 


GAGAAACAGAAACAGAAGG 


35 


V2 




4431 


AAGATACTCCTACACGAC 




VI 7 


+ 


4486 


GTCTCGTTTTTTCTCTCGG 




S49 


+ 


4870 


CTACCACAGTTCCCACCTAC 
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Table 4 Sequences corresponding to ESTs forVRNl and RTV1 , and 
sequences used for comparison to VRN1 in Figures 8-12 



Name 



Type Description 



Accession 
# 



92M2 
F2H7 



105022 
151H18 



89H14 



89123 



VRN1 



RTV1 



RAVI 

FUSCA3 

ABI3 

ARF1 

IAA24 

c-MYC 



nucl EST; VRN1 transcript T21005 



nucl EST; VRN1 transcript N95889 



nucl EST; RTV1 transcript 



nucl EST; RTV1 transcript 



247A13 nucl EST; RTV1 transcript 



nucl EST; RTV1 transcript 



nucl EST; RTV1 transcript 



aa 



aa 



3859591 aa 



CAA19759 aa 



CAA19755 aa 



CAA19754 aa 



aa 



aa 



aa 



aa 



aa 



aa 



Encoded by VRN1 

(putative) 
Encoded by RTV1 

(putative) 

Putative Arabidopsi s 
protein 

Putative Arabidopsi s 
protein 

Put a t i ve Arajbi dopsi s 
protein 

Putative Arabidopsi s 
protein 

Putative Arabidopsi s 
protein 

Put at i ve AraJbi dopsi s 
protein 

Putative Arabidopsi s 
protein 

Putative Arabidopsi s 
protein 

Put at i ve Arajbi dopsi s 
protein 

Putative Carassius 
auratus (goldfish) 
protein 



T22671 

T76788 

AA713228 

T20909 

T2 0 917 

N/A 

N/A 

AAC72857 

CAA19759 

CAA19755 

CAA19754 

BAA34250 

AAC35246 

JQ1676 

AAD39318 

AAB92476 

P49709 
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Table 5 Comparison of the Nucelotide and Amino Acid Sequences 
of RTV1 to VRN1 









Nucleotide 
s 


Amino 
Acids 


5 


Sequence 


Id 
a 


Length 
(bp) 


Range* 3 


Id 
a 


Sim. 
c 


Length 
(aa) 


Ranged 




RTV1 


69 


1026 


269- 


67 


74 


341 


1-341 




complete 
RTV1 Region 
1 


49 


283 


1291 
272-550 


42 


44 


93 


2-94 


10 


RTV1 Region 
2 


71 


429 


551-981 


71 


82 


144 


95-238 




RTV1 Region 
3 


84 


314 


982- 
1291 


95 


99 


103 


239-341 



15 a Identity (%) 

b Numbered relative to VRN1 transcript sequence (Figure 7) 
c Similarity (%) 

d Numbered relative to VRN1 encoded amino acid sequence 
(Figure 7) 
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Annex I - her VRN1 genomic (contig 29 [1501-65001) 

1 10 20 30 40 
TTTAAAATTCGAATTGGGATTTJ\AGAAAAATTCTCATCAA 
5 ATATTTATCATTAGTGTATATATATCAGTGTTTTACATTT 
GTTAATCCTAAATAATAAACCGATCTGAAAAGTTGATAAA 
TGCGTTGTCAAAAGACAAAATATACATCCAAACAAATCAC 
GTGATTGCCTTCAACTTGCCACgGGTTCAAAGATTTAACA 
AATCTTCTAAAACACCA ACTTAACCCACGAATACACAAGC 

1 0 ACAGAGTGGTGGTAAACATACAAGTTAATGAGTTATTCAA 
ATGAGATTTTCAATATCATTCTTCTTCAGCCCGTCACAAG 
AAGCCAAGATTAAGCCATTAGAGGAAGTTTATAAACCGAC 
AAAACCTGCTTAGATACAAAGAATACTAGCTAATGTGTTT 
CAACAAACTTCAAATTGACGATACGTTACATTCATATTAA 

1 5 TCACTTCAGAGCTTGATTATTCAAATTATTTTTTCTACTG 
TGATACATATATACACACATGTTTTGCTTTTCTATGATTC 
TATCTACATTTTCATACCGTTGAATAATTTATGTATGAAT 
TACGATGCAATTTCCTTCATTATGCTTGAATAAAATGCTT 
TTGGACATGCATGCGATATTGGATCTACTTTTGGATTCTA 

2 0 TTTTTAAAAATCAGCGAGTTTGTTGCTTTGTAATTTTTAA 
TTAGGCATCAAGAATTTCTAAAATGCACGCGAACTGGTGA 
AAAGAGGAATGTTTACGTTTACCCCTTTATTTTCTTACAG 
CTCATAAGGATACTGTCAGAAGACAGAACCAAGGCTCTCT 
GACTATAAATTGGAATCCATTTAAACATAATGTTATGACC 

2 5 AATGATGGCCAACGGTTAGCCCAAACTAATTAACTACAAG 

TCAAGTTCCAATATTCTAAGGAGAAATAATAGTATACTAA 
ACATACATTAGAGAGGTTAAACTTCTTTTTGGATTTAAGT 
GTGTATGCATAGGCTATTTATTCTTAAGTATAACTATTAA 
CTGTAGCTAGATTTATACAAGAAATACATAAAACTTTATG 

3 0 CATGTGAGGTAGCCATGAATATACGTACATGTTGCAATCG 

ATTATACATGTTGTATTTGGATTTCTCTATACATGTTTTA 
ACTTGTCATTCTCTAAGTATATACATACCATTAATACTGT 
GGGCATGAGTTTATGATAAGACTTTTCTTTTGGAGACCAG 
TTTTGTTTTCCTTTCCACCTATATTTGTCTATAGGCTTCa 

3 5 gACGGTACACTAGTTTACAAGTGTTTTTATATGTTCTAAA 

TAAAATTGAGATTTTCCGGAACGGTATGATCTGTTTGCAA 
ATAAGGACGTATATATAACAGTATCAAATATATTTGTTGT 
TATAAGGCAATAATATATTTTCTGAGATATTGCGTGTTAC 
AAAAAAGAAATATTTGTTAAGAAAAAAAAAGATGGTCGAA 

4 0 AAAGGGGAGTAGGTGGGGGCGGTCGGCTTTTGATTAGTTA 

ATAAAAGAAACCACACGAGTGACCTACCGATTCGACTCAA 
CGAGTCTACCGAGCTAACACAGATTCAACTCGCTCGAGCT 
TCGTTTTATGACAAGTTGGTTTTTTTTTTTTTTTTTTAAT 
TTTTTCATCTTCTTGGGTTTGGTTGGGTCACTCTTCAGGT 
4 5 CAGGTGTGTAAAAAAGAAAGAAAGAAAAGAGAGATTGTTG 



TGTTGTAACCCCTTTGACTAAAATCTAATGAACTTTTTTA 
ACACAACAAAACTCCTTCAGATCTGAAAGGGTTCTTCTTC 
TCTCTTAGTCTCTTTGTCCTTTTATTCTCCGTCGTCGTTT 
CATGATCTGACTCTCTGGTCTTCTCTTCTTCTTCTTCTTC 
TTCTATTTTTTCTTACTTCGTCACTGTTGTGTCTGAACAT 
GCCACGCCCTTTCTTCCATAAGTTGATTTTCTCATCCACT 
ATCCAAGAAAAACGTCTGGTAACTTACTCTCTCTCTCTCT 
CTCTCTCTCTGTTCTCCTTCTCCTCATCTTTCAAAGTTTT 
. GATTTTGTGCGA A ATTGAGGGTTTTCAAGGTTTGGAATCT 
GGTGAACGAGTTTGTAAGATTATGCCTTGTGACACTCTTG 
CTTGATTTCTTACAATTCACTTGTATTGATTCTTTGTAAG 
AATCGAGTCAAGGTTGTGCTTTTATCTTCTTACTCTTCCC 
TGTTTTGGGTAATGAAAAGAAGTTCCATTTTTGAACTTTG 
TGTTGTCTTATTGGTCAAATGAGAATTTGTGGGTTTCCAA 
TGGAAGTCTGCAAGACAGTTTCTTTTGGTCATTGGTTGTG 
TTTGGTGGGAAATTGGGTATTTGATGGTATATCTGTACTC 
TGACAGCATATTGTGTGTAGTTTGGGAATTTTTTTTTTTT 
TTTTGAGTGATTTGACTTTTGGAGGACGATTTGATTCTGT 
CAGATTGATCAAATTTCTTCTGAGGAGAAAAAGTTGAGAT 
CTGTTTATGGTTTCTCTATTATAAATGTCTGTTTTGTTTA 
CTCTATTTTGACTGTTTTCTCTGTTTGACTTAGGAATGTC 
TGAGATCTTAGACTCCTTATTGAGTATTGTGtGGCTTGTG 
AGTGAATCCCTAAAACTGAGTAGTTGACTTGTTTTGAAGG 
TCTCTATGTATTGTGCTTATGTTTTAAAGTTGTCTACTTT 
ATTTGATACAGTGATTAGTCATCACTTGTACAGATTCCCC 
CAAGAGCATTGTTTTGAACAAATCCAAATTTGCTTAGCTC 
TCCATTTGGCATTTAAGTGACTAGATTTTCTCTGGAATAA 
TGATTTCGATTAACACAGGCATTTATGTGGAACCAAGTTT 
GCAAATTATTAATGTGATAAGATCATAGGAGTCGTGTAAT 
CAATCTATTCAGAGATAAATGTACCATTTTACATGTGTAC 
TAATGGACTGTGTCTCCTTGTTGATGCCTTCTCTAAACTG 
AAATATGGCCTTTTGGTTTGTGTTTTTAAATTAGGTAAAG 
CCGTCGTTTCTTCAGCTACTGTGTTTATTGGATGTTTTTG 
CTGAAAAATGTCTGTTTCGATTTGATGTTCTCGCAATATT 
CTGTGCTGTTCTTATAGATATTGTGGACATTTATATCATT 
ATATGCTTCTTTATATCTCATACCGGCATGCTTGTGCAGA 
GGGTCCCAGATAAGTTTGTGAGTAAATTCAAGGATGAGCT 
TTCGGTTGCTGTTGCACTCACAGTACCTGATGGTCATGTT 
TGRCGTGTAGGACTAAGGAAAGCTGACAACAAAATTTGGT 
TTCAAGATGGTTGGCAAGAGTTTGTTGACCGTTACTCCAT 
TCGCATTGGTTATCTTTTGATTTTTAGATATGAAGGAAAC 
TCTGCCTTCAGCGTCTACATTTTCAATTTATCCCACTCTG 
AGATCAATTACCATTCCACCGGTCTCATGGATTCCGCTCA 
CAACCACTTCAAACGCGCCCGTTTGTTTGAAGACCTTGAA 
GATGAAGATGCCGAGGTCATCTTTCCTTCTTCTGTGTACC 
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CATCACCACTTCCTGAGTCTACAGTACCAGCCAACAAAGG 
GTATGCTAGTTCAGCCATCCAAACCTTGTTCACTGGACCA 
GTTAAAGGTGATATTTATAACCAACTGATTCCCTTTATCT 
ATCGCTGATTACGCGTCTTATCATTCTTTTGAGGTTGATG 
5 CTTGATATTTTCCTTATCTCCAGCTGAAGAGCCAACGCCA 
AC C C C AAAAATAC C TAAAAAGAG AGGGAGG AAGAAG a AAA 
ATGCTGATCCTGGTAAGCACTTTTCCTCTTTGAAATGCTT 
CAGACTCGTTTTCAGAGGATTCACAGATTCTTCCTCATGA 

TACATATATCCTTTTGATATT.GTC CT TACAGAGGAAATAA 

1 0 ACTCATCAGCTCCGCGAGATGATGATCCAGAGAACCGTTC 
AAAGTTCTACGAGAGTGCTTCTGCGAGAAAGAGAACCGTG 
ACTGCAGAAGAAAGAGAGAGAGCCATCAATGCAGCCAAAA 
CGTTCGAACCAACAAACCCTTTCTTCAGAGTGGTTCTGCG 
ACCATCCTATCTATACAGAGGTTGCATCATGGTAATAAAA 

1 5 AAACATCTTAGGAAGACTTAATCTTATCGGTGTCTTCACT 
GATCTCTAAAAGAAGCCTTCTGTTTCTGTTTCTCTCAACA 
GTATCTTCCTTCTGGGTTTGCTGAGAAGTACCTAAGTGGG 
ATCTCCGGGTTCATCAAAGTCCAGCTTGCGGAGAAACAAT 
GGCCTGTTCGATGTCTCTACAAAGCCGGGAGAGCCAAATT 

2 0 CAGTCAAGGATGGTACGAATTCACTCTAGAGAACAACTTA 
GGAGAAGGAGACGTCTGTGTGTTTGAGCTGCTCAGAACCA 
GAGATTTCGTTTTGAAAGTGACAGCCTTTCGAGTCAACGA 
GTACGTCTGAACAAAGCATTATGGTGTGATCATTCTGGAT 
TTGCAAGTACAATGTCGTGTAGGAGTATCTTAATTTAAAA 

2 5 ACAACTAAAAAACTCTCTTCTGGTCTGTGTCATTATTGCG 

TCAGTGTCTCGTTTTTTCTCTCGGGTTTACTTTGTGTTAT 
CGATGTGGATAAGTTGTTTTTACCTCATTATATATAACCT 
CTTGAGTGGAACTCAAATTGTTTGAGTAGAACAAACAAAG 
TTAGGGTTTAAGAAGAAGTCTGTAAATACCTAATCTCCAT 

3 0 CAAATTTGAGTAGAAAGACAAACTGTTCTGGTGGAATACA 

AGGAGGGAACTTGAGATAACAAACTTAAGAATAGCCTTCA 
AGCCAACGTCTAGAATTTGATGAAGTTGTTGTTTGATCAC 
CTCTGAGATAATTGGAAACCCTCTTCATGCAGTTTGCTTG 
AGGATACTGGTGAAAATGGGAGTATTGAAGGAAAATGCAT. 
3 5 ATATAAGATTGTAGGTGGGAACTGTGGTAGCAGACACAAC 
ACTTGTTCTCTAGACATATACTGTACCAGACATGTTTTGA 
TCATAAAACTTAAAAAAAAGAAAACCGTGTGTAAATCAAG 
CAAGGAACAACTACAATATTACAATCTTATTGAGATATCA 
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Figure 3 
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Figure 4. 
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Figure 7 




TCTTGGGTTTGGTTGGGTCACTCTTCAGGTCAGGTGTGTAAAA 4 3 

AAG AAAG AAAG AAAAG A G AG ATTG TTG TG TTG T AAC C C C TTTG AC T AAAATC T AATG AAC TTTTTT AA C AC AAC A 118 

AAACTCCTTCAGATCTGAAAGGGTTCTTCTTCTCTCTTAGTCTCTTTGTCCTTTTATTCTCCG 193 

GATCTGACTCTCTGGTCTTCTCTTCTTCTTCTTCTTCTTCTATTTTTTCTTAC 2 68 

▼ 

ATGCCACGCCCTTTCTTCCATAAGTTGATTTTCTCATCCACTATCCAAGAAAAACGTCTGAGGGTCCCAGATAAG 343 

M PRPFFHKLIFSSTIQEKRLRVPDK 2 5 

tga in vm1-1 

TTTGTG AGTAAATTCAAGGATGAGCTTTCGGTTGCTGTTGCACTCACAGTACCTGATGGT^ 418 

FVSKFKDELSVAV ALTVPDGHV (w) R V 50 



GGACTAAGGAAAGCTGACAACAAAATTTGGTTTCAAGATGGTTGGCAAGAGTTTGTTGACCGTTACTCCATTCGC 4 93 

GLRKADNKIWFQDGWQEFVDRYSIR 75 

ATTGGTT ATCTTTTG ATTTTTAG AT ATG AAGG AAACT CTG C C TTC AG C G TC T AC ATTTTC AATTTATC C C ACTCT 568 

IGYLLIFRYEGNSAF SVYI F N L S H S 100 



GAGATCAATTACCATTCCACCGGTCTCATGGATTCCGCTCACAACCACTTCAAACGCGCCCGTTTGTTTGA a .GAC 6 43 

EINYHSTGLMDSAHNHFKRA R L F E D 125 

CTTGAAGATGAAGATGCCGAGGTCATCTTTCCTTCTTCTGTGTACCCATCACCACTTC 718 

LEDEDAEVIFPSSVYPSPLPESTVP 150 

GCCAACAAAGGGTATGCTAGTTCAGCCATCCAAACCTTGTTCACTGGACCAGTTAAAGCTGAAGAGCCAACGCCA 793 

A N K GYASSAIQTLFTGPV K A E E P T P 175 

G-A in vrn1~2 ▼ 

ACCCCAAAAATA CCTAAAAAGAGAGGGAGGAAGAAGAAAAATG CTGATCCTGAGGAAATAAACTCATCAGCTCCG 8 68 

T P K I [PKKRGRKKK (n)| ADPEEINSSAP 200 

CGAGATGATGATCCAGAGAACCGTTCAAAGTTCTACGAGAGTGCTTCTGCGAGAAAGAGAACCGTGACTGCAGAA 943 

RDDDPENRSKFYESASARKRTVTAE 225 



GAAAGAGAGAGAGCCATCAATGCAGCCAAAACGT^CGAACCAACAAACCCTTTCTTCAGAGTGGTTCTGCGACCA 1018 

ERERAINAAKTFE PTNPFFRVVLRP 250 

TCCTATCTATACAGAGGTTGCATCATGTATCTTCCTTCTGGGTTTGCTGAGA^ 1093 

SYLYRGCIMYLPSGFAEKYLSGISG 27 5 

TTC ATC AAAGTC C AGCTTGCGG AG AAAC AATGG C CTGTTCG ATG TC TC TAC AAAGCC GGG AG AG CC AAATTC AGT 1168 

FIKVQLAEKQWPVRCLYKAGRAKFS 3 00 

CAAGGATGGTACGAATTCACTCTAGAGAACAACTTAGGAGAAGGAG 1243 

QGWYEFTLENNLGEGDVCVFELLRT 325 

AGAGATTTCGTTTTG AAAGTGACAGCCTTTCG AGTC AACGAGTACGTCTCAAC AAAGC ATTATGGTGTGATC ATT 1318 

R D F V L K V TAFRVNEYV . 341 



CTGGATTTGCAAGTACAATGTCGTGTAGGAGTATCTTAATTTAAAAACAACTAAAAAACTCTCT^ 13 93 

TCATTATTGCGTCAGTGTCTCGTTTTTTCTCTCGGGTTTACTTT 14 68 

TCATTATATATAACCTCTTGAGTGGAA 14 9 5 



NLS (boxed), B3 DNA-binding domains (underlined), 
PEST regions (doubly underlined), protein kinase C phosphorylation site (asterisks). The 
positions of introns are indicated with arrowheads. The positions of the mutations in vrn1-1 and 
vrn1-2are circled. 
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Figure 8 
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